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METASTATIC BREAST AND COLON CANCER REGULATED GENES 

TECHNICAL FIELD OF THE INVENTION 

This invention relates to methods for predicting the behavior of tumors. 
More particularly, the invention relates to methods in which a tumor sample is 
5 examined for expression of a specified gene sequence thereby to indicate propensity for 
metastatic spread. 

BACKGROUND OF THE INVENTION 

Breast cancer is one of the most common malignant diseases in women, 
with about 1,000,000 new cases per year worldwide. Colon cancer is another of the 

10 most common cancers. Despite use of a number of histochemical, genetic, and 
immunological markers, clinicians still have a difficult time predicting which tumors 
will metastasize to other organs. Some patients are in need of adjuvant therapy to 
prevent recurrence and metastasis and others are not. However, distinguishing between 
these subpopulations of patients is not straightforward, and course of treatment is not 

15 easily charted. There is a need in the art for new markers for distinguishing between 
tumors which will or have metastasized and those which are less likely to metastasize 

SUMMARY OF THE INVENTION 

It is an object of the present invention to provide markers for 
distinguishing between tumors which will or have metastasized and those which are less 
20 likely to metastasize. These and other objects of the invention are provided by one or 
more of the embodiments described below. 

One embodiment of the invention provides an isolated and purified 
human protein having an amino acid sequence which is at least 85% identical to an 
amino acid sequence encoded by a nucleotide sequence selected from the group 
25 consisting of SEQ ID NOS: 1 -63 or the complement thereof. 

Another embodiment of the invention provides a fusion protein which 
comprises a first protein segment and a second protein segment fused to each other by 
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means of a peptide bond. The first protein segment consists of at least six contiguous 
amino acids selected From an amino acid sequence encoded by a nucleotide sequence 
selected from the group consisting of SEQ ID NOS: 1-63 or the complement thereof 

Yet another embodiment of the invention provides an isolated and 
. 5 puriil^u polypeptide consisting of at least six contiguous amino acids of a human 
protein having an amino acid sequence encoded by a nucleotide sequence selected from 
the group consisting of SEQ ID M0S:1-6j or the complement thereof 

Still another embodiment of the invention provides a preparation of 
antibodies which specifically bind to a human protein which comprises an amino acid 
10 sequence encoded by a nucleotide sequence selected from the group consisting of SEQ 
ID NOS:l-63 or the complement thereof 

Even another embodiment of the invention provides an isolated and 
purified subgenomic polynucleotide comprising at least 1 1 contiguous nucleotides of a 
nucleotide sequence which is at least 96% identical to a nucleotide sequence selected 
15 from the group consisting of SEQ ID NOS:l-63 or the complement thereof 

Another embodiment of the invention provides an isolated and purified 
uene which comprises a coding sequence comprising a nucleotide sequence selected 
from the group consisting of SEQ ID NOS:l-63 or the complement thereof 

Yet another embodiment of the invention provides a method tor 
20 determining metastasis in a tissue sample. An expression product of a gene which 
comprises a coding sequence selected from the group consisting of SEQ ID NOS:l. 2. 
4, 5, 9, 11- 13. 14, 18, 19. 20. 22, 24, 26, 29, 30, 33. 35, 36, 38-41. 45, 48, 52, 55. 57. 
58, 60, 63-66, 69-74, 76, 80, 82, and 83 is measured in a tissue sample. A tissue sample 
which expresses the product is categorized as metastatic. 
25 Still another embodiment of the invention provides a method for 

determining metastasis in a tissue sample. An expression product of a gene which 
comprises a sequence selected from the group consisting of SEQ ID N0S:3, 7. 8, 10. 
12. 15-17. 21. 23. 28, 31. 34, 37. 42-44. 46, 47. 49-51. 53, 59, 61. 62, 67. 68, 75, 77-79. 
81, 84. and 85 is measured in a tissue sample. A tissue sample which does not express 
30 the product is categorized as metastatic. 
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Even another embodiment of the invention provides a method for 
determrning metastatic potential in a tissue sample. An expression product of a gene 
v/hich comprises a sequence selected from the group consisting of SEQ ID NOS:U 2, 
4, 5, 9, 1 1. 13, 14, 18, 19, 20, 22. 24, 26, 29, 30, 33, 35, 36, 38-41,. 45, 48. 52, 55, 57, 
58, 60, 63-66. 69-74, 76, 80, 82, and 83 is measured in a tissue sample. A tissue sample 
which expresses the product is categorized as having metastatic potential. 

A further embodiment of the invention provides a method for 
determ.ining metastatic potential in a tissue sample. An expression product of a gene 
which comprises a sequence selected from the group consisting of SEQ ID N0S;3. 1, 
8, 10, 12, 15-17, 21, 23, 28, 31, 34, 37, 42-44, 46. 47, 49-51, 53, 59, 61, 62, 67, 68. 75, 
77-79, 81. 84. and 85 is measured in a tissue sample. A tissue sample which does not 
express the product is categorized as having metastatic potential. 

Another embodiment of the invention provides a method of predicting 
the propensity for metastatic spread of a breast tumor preferentially lo bone or lung. An 
expression product of a gene which comprises a sequence selected from the group 
consisting of SEQ ID NO: 1 , 5. 1 1 , 1 8, 20, 22, 24, 30. 33,35, 36, 38. 45, 52, 58, 65. 66, 
70, 74, 70. and 80 is measured in a breast tumor sample. A breast tumor sample which 
expresses the product is categorized as having a propensity to metastasize lo bone or 

lung. i 

Even another embodiment of the invention provides a method of 
predicting propensity for metastatic spread of a breast tumor preferentially to lung. An 
expression product of a gene which comprises a sequence selected from the group 
consisting of SEQ ID N0S:2, 4, 9, 13, 14, 19. 26. 29, 39-41, 48, 55. 57, 60, 63. 64, 72, 
73, 82, and 83 is measured in a breast tumor sample. A breast tumor sample which 
expresses the product is characterized as having a propensity to metastasize to lung. 

Still another embodiment of the invention provides a method of 
predicting propensity for metastatic spread of a colon tumor. An expression product ot 
a gene which comprises the nucleotide sequence shown in SEQ ID NO:56 is measured 
in a colon tumor sample. A colon tumor sample which expresses the product is 
characterized as having a low propensity to metastasize. 
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Even another embodiment of the invention provides a method for 
determining metastasis in a tissue sample. An expression product of a gene which 
comprises a coding sequence selected from, the group consisting of SEQ ID NOSzS, 7, 
8, 10, 12, 15-17. 21, 23, 25, 28, 31, 34. 37, 42-44, 46, 47, 49, 61. 62, 67, 68. 75, 77-79. 
5 81, 84, and 85 is measured in a tissue sample. A tissue sample which expresses the 
product is categorized as non-metastatic. 

Yet another embodiment of the invention provides a method for 
determining metastasis in a tissue sample. An expression product of a gene which 
comprises a coding sequence selected from the group consisting of SEQ ID N0S:3, 7, 
0 8, 10, 12, 15-17, 21, 23, 25, 28, 31, 34, 37, 42-44, 46, 47, 49, 6L 62, 67, 68. 75, 77-79.' 
8i, 84, and 85 is measured in a tissue sample. A tissue sample which does not express 
the product is categorized as metastatic. 

The invention thus provides the art with a number of genes and proteins, 
which can be used as markers of metastasis. These are useful for more rationally 
5 prescribing the course of therapy for breast or colon cancer patients. 

DETAILED DESCRIPTION 

It is a discovery of the present invention that a number of genes are 
differentially expressed between metastatic cancer cells, especially cancer cells of the 
breast and colon, and non-metastatic cancer cells. These genes are metastatic marker 
genes. This information can be utilized to make diagnostic reagents specific for the 
expression products of the differentially expressed genes. It can also be used in 
diagnostic and prognostic methods which will help clinicians in planning appropriate 
treatment regimes for cancers, especially of the breast or colon. 

Some of the polynucleotides disclosed herein represent novel genes 
which are differentially expressed between non-metastatic cancer cells and cancer cells 
which have a potential to metastasize. SEQ ID NOS:l-63 represent novel metastatic 
marker genes (Table 1). SEQ ID NOS:64-85 represent known genes which have been 
found to be differentially expressed in metastatic relative to non-metastatic cancer cells 
(Table 2). Some of the metastatic marker genes disclosed herein are expressed in 
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metastatic cells relative to non-metastatic cells, particularly in breast cancer cells which 
metastasize to bone and lung (SEQ ID N0S:1. 5, 1 1, 18, 20, 21. 24, 30, 33, 35, 36, 38, 
45, 52, 58/65, 66, 70, 74, 76, and 80). One metastatic marker gene (SEQ ID NO:56) is 
expressed in non-metastatic breast cancer cells and in colon cancer cells with low 
5 metastatic potential. Other metastatic marker genes are expressed in metastatic cancer 
cells, particularly in breast cancer cells which metastasize only to lung (SEQ ID N0S:2, 
4, 9, 13, 14, 19, 26, 29, 39-4L 48, 55, 57, 60, 63, 64, 72, 73, 82, and 83). Still other 
metastatic marker genes (SEQ IDN0S:3, 7, 8, 10, 12, 15-17, 21, 23, 28, 31, 34, 37, 42- 
44, 46, 47, 49, 61, 62, 67, 68, 75, 77-79, 81, 84, and 85) are expressed in cancer cells 
10 which do not typically metastasize, particularly in breast cancer cells. Identification of 
these relationships and markers permits the formulation of reagents and methods as 
further described below. Other metastatic marker genes, such as those which comprise 
a nucleotide sequence shown in SEQ ID N0S:6, 27, 32, and 54, can be used to identify 
cancerous tissue, particularly breast cancer tissue. 
15 Sequences of metastatic marker genes are disclosed in SEQ ID N0S:1- 

85. Metastatic marker proteins can be made by expression of the disclosed 
polynucleotide molecules. Amino acid sequences encoded by novel polynucleotides of 
the invention can be predicted by running a translation program for each of three 
reading frames for a disclosed sequence and its complement. Complete polynucleotide 
20 sequences can be obtained by chromosome walking, screening of libraries for 
overlapping clones, 5' RACE, or other techniques well known in the art. 

Reference to metastatic marker nucleotide or amino acid sequences 
includes variants which have similar expression patterns in metastatic relative to non- 
metastatic cells, as described below. Metastatic marker polypeptides can differ in 
25 length from full-length metastatic marker proteins and contain at least 6, 8, 10, 12, 15, 
18. 20, 25. 30, 35, 40. 45. 50. 55, 60, 65. 70. 75. 80. 85, 90. 95. 100. 120, 140, 160, 180. 
or 200 or more contiguous amino acids of a metastatic marker protein. 

Variants of marker proteins and polypeptides can also occur. Metastatic 
marker protein or polypeptide variants can be naturally or non-naturally occurring. 
30 Naturally occurring metastatic marker protein or polypeptide variants are found in 
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humans or other species and comprise amino acid sequences which are substantially 
identical to the proteins encoded by genes corresponding to the nucleotide sequences 
shown in SEQ ID NOS:l-85 or their complements. Non-naturally occurring metastatic, 
marker protein or polypeptide variants which retain substantially the same differential 
5 expression patterns in metastatic relative to non-metastatic cancer cells as naturally 
occurring metastatic marker protein or polypeptide variants are also included here. 
Preferably, naturally or non-naturally occurring metastatic marker protein or 
polypeptide variants have amino acid sequences which are at least 85%, 90%, or 95% 
identical to amino acid sequences encoded by the nucleotide sequences shown in SEQ 

10 ID NOS:l-85. More preferably, the molecules are at least 98% or 99% identical. 
Percent sequence identity between a wild-type protein or polypeptide and a variant is 
determined by aligning the wild-type protein or polypeptide with the variant to obtain 
the greatest number of amino acid matches, as is known in the art, counting the number 
of amino acid matches between the wild-type and the variant, and dividing the total 

15 number of matches by the total number of amino acid residues of the wild-type 
sequence. 

Preferably, amino acid changes in metastatic marker protein or 
polypeptide variants are conservative amino acid changes, Le., substitutions of similarly 
charged or uncharged amino acids. A conserx'ative amino acid change involves 

20 substitution of one of a family of amino acids which are related in their side chains. 
Naturally occurring amino acids are generally divided into four families: acidic 
(aspartate, glutamate), basic (lysine, arginine, histidine), non-polar (alanine, valine, 
leucine, isoleucine, proline, phenylalanine, methionine, tryptophan), and uncharged 
polar (glycine, asparagine, glulamine, cystine, serine, threonine, tyrosine) amino acids; 

25 Phenylalanine, tryptophan, and tyrosine are sometimes classified jointly as aromatic 
amino acids. 

It is reasonable to expect that an isolated replacement of a leucine with 
an isoleucine or valine, an aspartate with a glutamate. a threonine with a serine, or a 
similar replacement of an amino acid with a structurally related amino acid will not 
30 have a major effect on the biological properties of the resulting metastatic marker 
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protein or polypeptide variant. Properties and functions of metastatic marker protein or 
polypeptide variants are of the same type as a metastatic marker protein or polypeptide 
comprising amino acid sequences encoded by the nucleotide sequences shown in SEQ 
ID NOS:l-85, although the properties and functions of variants can differ in degree. 

5 Whether an amino acid change results in a metastatic marker protein or polypeptide 
variant with the appropriate differential expression pattern can readily be determined. 
For example, nucleotide probes can be selected from the marker gene sequences 
disclosed herein and used to detect mjarker gene mRNA in Northern blots or in tissue 
sections, as is known in the art. Ahematively, antibodies which specifically bind to 

10 protein products of metastatic marker genes can be used to detect, expression of 
metastatic marker proteins. 

Metastatic marker variants include glycosylated forms, aggregative 
conjugates with other molecules, and covalent conjugates with unrelated chemical 
moieties. Metastatic marker variants also include allelic variants, species variants, and 

15 muteins. Truncations or deletions of regions which do not affect the differential 
expression of metastatic marker genes are also metastatic marker variants. Covalent 
variants can be prepared by linking functionalities to groups which are found in the 
amino acid chain or at the N- or C-terminal residue, as is known in the art. 

Full-length metastatic marker proteins can be extracted, using standard 

20 biochemical methods, from metastatic marker protein-producing human cells, such as 
metastatic breast or colon cancer cells. An isolated and purified metastatic marker 
protein or polypeptide is separated from other compounds which normally associate 
with a metastatic marker protein or polypeptide in a cell such as certain proteins, 
carbohydrates, lipids, or subcellular organelles. A preparation of isolated and purified 

25 metastatic marker proteins or polypeptides is at least 80% pure; preferably, the 
preparations are 90%. 95%, or 99% pure. 

Metastatic marker proteins and polypeptides can also be produced by 
recombinant DNA methods or by synthetic chemical methods. For production of 
recombinant metastatic marker proteins or polypeptides, coding sequences selected 

30 from the nucleotide sequences shown in SEQ ID NOS:l-85, or variants of those 
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sequences which encode metastatic marker proteins, can be expressed in known 
prokaryotic or eukaryotic expression systems (see below). Bacterial, yeast, insect, or 
mammalian expression systems can be used, as is known in the art. 

Ahematively, synthetic chemical methods, such as sohd phase peptide 

5 synthesis, can be used to synthesize a metastatic marker protein or polypeptide. 
General means for the production of peptides, analogs or derivatives are outlined in 
Chemistry and Biochemistry of Amino Acids, Peptides, and Proteins - A 
Survey of Recent Developments, Weinstein, B. ed.. Marcell Dekker, Inc., publ.. 
New York (1983). Moreover, substitution of D-amino acids for the normal L- 

10 stereoisomer can be carried out to increase the half-life of the molecule. Metastatic 
marker variants can be similarly produced. 

Non-naturally occurring fusion proteins comprising at least 6, 8, 10, 12, 
15, 18, 20, 25, 30, 35,40, 50, 55, 60, 65, 70, 75, 80, 85, 90, 95, 100, 120, 140, 160, 180, 
or 200 or more contiguous metastatic marker amino acids can also be constructed. 

15 Human metastatic marker fusion proteins are useful for generating antibodies against 
metastatic marker amino acid sequences and for use in various assay systems: For 
example, metastatic marker fusion proteins can be used to identify proteins which 
interact with metastatic marker proteins and influence their functions. Physical 
methods, such as protein affinity chromatography, or library-based assays for protein- 

20 protein interactions, such as the yeast two-hybrid or phage display systems, can also be 
used for this purpose. Such methods are well known in the art and can also be used as 
drug screens. 

A metastatic marker fusion protein comprises two protein segments 
fused together by means of a peptide bond. The first protein segment comprises at least 

25 6, 8, 10. 12, 15. 18, 20, 25, 30, 35, 40, 50, 55. 60. 65. 70, 75, 80. 85, 90, 95, 100. 120, 
140, 160, 180, or 200 or more contiguous amino acids of a metastatic marker protein. 
The amino acids can be selected from the amino acid sequences encoded by the 
nucleotide sequences shown in SEQ ID NOS:l-85 or from variants of those sequences, 
such as those described above. The first protein segment can also comprise a full- 

30 length metastatic marker protein. 
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The second protein segment can be a full-length protein or a protein 
fragment or polypeptide. The fusion protein can be labeled with a detectable marker, as 
is known in the art, such as a radioactive^ fluorescent^ chemiluminescent^ or biotinylated 
marker. The second protein segment can be an enzyme which will generate a detectable 
5 product, such as P-galactosidase. The first protein segment can be N-terminal or C- 
terminal, as is convenient. 

Techniques for making fusion proteins, either recombinantly or by 
covalently linking two protein segments, are also well known. Recombinant DNA 
methods can be used to prepare metastatic marker fusion proteins, for example, by 
10 making a DNA construct which comprises coding sequences selected from SEQ ID 
NOS:l-85 in proper reading frame with nucleotides encoding the second protein 
segment and expressing the DNA construct in a host cell, as described below. 

Isolated and purified metastatic marker proteins, polypeptides, variants, 
or fusion proteins can be used as imrnunogens, to obtain preparations of antibodies 
15 which specifically bind to a metastatic marker protein. The antibodies can be used, 
inter olio, to detect wild-type metastatic marker proteins in human tissue and fractions 
thereof. The antibodies can also be used to detect the presence of mutations in 
metastatic marker genes which result in under- or over-expression of a metastatic 
marker protein or in expression of a metastatic marker protein with altered size or 
20 electrophoretic mobility. 

Preparations of polyclonal or monoclonal antibodies can be made using 
standard methods. Single-chain antibodies can also be prepared. Single-chain 
antibodies which specifically bind to metastatic marker proteins, polypeptides, variants, 
or fusion proteins can be isolated, for example, from single-chain immunoglobulin 
25 display libraries, as is known in the art. The library is '^panned'" against metastatic 
marker protein amino acid sequences, and a number of single chain antibodies which 
bind with high-affinity lo different epitopes of metastatic marker proteins can be 
isolated. Hayashi et al,, 1995. Gene 760:129-30. Single-chain antibodies can also be 
constructed using a DNA amplification method, such as the polymerase chain reaction 
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(PGR), using hybridoma cDNA as a template. Thirion ct aL. 1996, Eur. J. Cancer 
Prev. 5:507-1 1, 

Single-chain antibodies can be mono- or bispecific, and can be bivalent 
or tetravalent. Construction of tetravalent, bispecific single-chain antibodies is taught in 
5 Coloma and Morrison, 1997, Nat. BiofechnoL /i: 159-63, Construction of bivalent, 
bispecific single-chain antibodies is taught in Mallender and Voss, 1994, ./ Biol. Chent 
269:199-206. 

A nucleotide sequence encoding the single-chain antibody can be 
constructed using manual or automated nucleotide synthesis, cloned into DNA 

10 expression constructs using standard recombinant DNA methods, and introduced into 
cells which express the coding sequence, as described below. Alternatively, single- 
chain antibodies can be produced directly using, for example, filamentous phage 
technology. Verhaar e/ aL, 1995, Ini. 1 Cancer 67:497-501 ; Nicholls e/ uL, 1993, ./ 
ImnmnoL Mc(h. 765:81-91, 

15 Metastatic marker-specific antibodies specifically bind to epitopes 

present in a full-length metastatic marker protein having an amino acid sequence 
encoded by a nucleotide sequence shown in SEQ ID NOS:l-85, to metastatic marker 
polypeptides, or to metastatic marker variants, either alone or as part of a fusion protein. 
Preferabl)'. metastatic marker epitopes are not present in other humdn proteins. 

20 Typically, at least 6, 8, 10, or 12 contiguous amino acids are required to form an 
epitope. However, epitopes which involve non-contiguous amino acids may require 
more, e.g., at least 15, 25, or 50 amino acids. 

Antibodies which specifically bind to metastatic marker proteins, 
polypeptides, fusion proteins, or variants provide a detection signal at least 5-, 10-. or 

25 20-fold higher than a detection signal provided with other proteins when used in 
Western blots or other immunochemical assays. Preferably, antibodies which 
specifically bind to metastatic marker epitopes do not detect other proteins in 
immunochemical assays and can immunoprecipitaie a metastatic marker protein, 
polypeptide, fusion protein, or variant from solution. 
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Antibodies can be purified by methods well known in the art. 
Preferably, the antibodies are affinity purified, by passing the antibodies over a column 
to which a metastatic marker protein, polypeptide, variant or fusioa protein is bound. 
The bound antibodies can then be eluted from the column, for example, using a buffer 

5 with a high salt concentration. 

Subgenomic polynucleotides contain less than a whole chromosome. 
Preferably, the polynucleotides are intron-free. In a preferred embodiment, the 
polynucleotide molecules comprise a contiguous sequence of 10. II, 12, 15, 20, 25, 30, 
32, 35, 40, 45, 50, 60, 70, 74, 80, 90, 100, 125, 150, 154, 175. 182, 200, 243, or 268 

10 nucleotides selected from SEQ ID NOS:l-85 or the complements thereof The 
complement of a nucleotide sequence shown in SEQ ID NOS:l-85 is a contiguous 
nucleotide sequence which forms Watson-Crick base pairs with a contiguous nucleotide 
sequence shown in SEQ ID NOS:l-85. The complement of a nucleotide sequence 
shown in SEQ ID NOS:l-85 (the antisense strand) is also a subgenomic polynucleotide, 

!5 and can he used provide marker protein antisense oligonucleotides. Double-stranded 
polynucleotides which comprise one of the nucleotide sequences shown in SEQ ID 
NOS:l-85 are also subgenomic polynucleotides. Metastatic marker protein 
subgenomic polynucleotides also include polynucleotides which encode metastatic 
marker protein-specific single-chain antibodies and ribozymes. or fusion proteins 

20 comprising metastatic marker protein amino acid sequences. 

Degenerate nucleotide sequences encoding amino acid sequences of 
metastatic marker protein and or variants, as well as homologous nucleotide sequences 
which are at least 85%, 90%, 95%, 98%, or 99% identical to the nucleotide sequences 
shown in SEQ ID NOS:l-85, are also metastatic marker subgenomic polynucleotides. 

25 Typically, homologous metastatic marker subgenomic polynucleotide sequences can be 
confirmed by hybridization under stringent conditions, as is known in the an. ^Pe^cent 
sequence identity between wild-type and homologous variant sequences is determined 
by aligning the wild-type polynucleotide with the variant to obtain the greatest number 
of nucleotide matches, as is known in the art. counting the number of nucleotide 

30 matches between the wild-type and the variant, and dividing the total number of 
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matches by the total number of nucleotides of the wild-type sequence. A preferred 
algorithm for calculating percent identity is the Smith- Waterman homology search 
algorithm as implemented in MPSRCH program (Oxford Molecular) using an affine 
gap search with the following search parameters: gap open penalty of 10, and gap 
extension penalty of 1 . 

Metastatic marker subgenomic polynucleotides can be isolated and 
purified free from other nucleotide sequences using standard nucleic acid purification 
techniques. For example, restriction enzymes and probes can be used to isolate 
polynucleotide fragments which comprise nucleotide sequences encoding a metastatic 
marker protein. Isolated and purified subgenomic polynucleotides are in preparations 
which are free or at least 90% free of other molecules. 

Complementary DNA molecules which encode metastatic marker 
proteins can be made using reverse transcriptase, with metastatic marker mRNA as a 
template. The polymerase chain reaction (PGR) or other amplification techniques can 
be used to obtain metastatic marker subgenomic polynucleotides, using either human 
genomic DNA or cDNA as a template, as is known in the art. Alternatively, synthetic 
chemistry techniques can be used to synthesize metastatic marker subgenomic 
polynucleotides which comprise coding sequences for regions of metastatic marker 
proteins, single-chain antibodies, or ribozymes. or which comprise antisense 
oligonucleotides. The degeneracy of the genetic code allows alternate nucleotide 
sequences to be synthesized which will encode a metastatic marker protein comprising 
amino acid sequences encoded by the riucleotide sequences shown in SEQ ID N0S:1- 



85. 



Purified and isolated metastatic marker subgenomic polynucleotides can 
be used as primers to obtain additional copies of the polynucleotides or as probes for 
identifying wild-type and mutant metastatic marker protein coding sequences. 
Metastatic marker subgenomic polynucleotides can be used lo express metastatic 
marker mRNA, protein, polypeptides, or fusion proteins and to generate metastatic 
marker antisense oligonucleotides and ribozymes. 



wo 0B/22!30 



13 



PCT/LIS99/24222 



A metastatic marker subgenomic polynucleotide comprising metastatic 
marker protein coding sequences can be used in an expression construct. Preferably, the 
metastatic marker subgenomic polynucleotide is inserted into an expres^on plasmid 
(for example, the Ecdyson system, pIND, In Vitro Gene). Metastatic marker 
5 subgenomic polynucleotides can be propagated in vectors and cell lines using 
techniques well known in the art. Metastatic marker subgenomic polynucleotides can 
be on linear or circular molecules. They can be on autonomously replicating molecules 
or on molecules without replication sequences. They can be regulated by their own or 
by other regulatory sequences, as are known in the art. 

10 A host cell comprising a metastatic marker expression construct can 

then be used to express all or a portion of a metastatic marker protein. Host cells 
comprising metastatic marker expression constructs can be prokaryotic or eukaryotic. 
A variety, of host cells are available for use in bacterial, yeast, insect, and human 
expression systems and can be used to express or to propagate metastatic marker 

15 expression constructs (see below). Expression constructs can be introduced into host 
cells using any technique known in the art.. These techniques include transferrin- 
polycation-mediated DNA transfer, transfection with naked or encapsulated nucleic 
acids, liposome-mediated cellular fusion, intracellular transportation of DNA-coated 
latex beads, protoplast fusion, viral infection, eleciroporation, and calcium phosphate- 

20 mediated transfection. 

A metastatic marker expression construct comprises a promoter which is 
functional in a chosen host cell. The skilled artisan can readily select an appropriate 
promoter from the large number of cell type-specific promoters known and used in the 
art. The expression construct can also contain a transcription terminator which is 

25 functional in the host cell. The expression construct comprises a polynucleotide 
segment which encodes all or a portion of the metastatic marker protein, variant, fusion 
protein, antibody, or ribozyme. The polynucleotide segment is located downstream 
from the promoter. Transcription of the polynucleotide segment initiates at the 
promoter. The expression construct can be linear or circular and can contain sequences, 

30 if desired, for autonomous replication. 
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Bacterial systems for expressing metastatic marker expression constructs 
include those described in Chang el al. Nature (1978) 2^5: 615. Goeddel et ai. Nature 
(1979) 281: 544, GoeddeL et al. Nucleic Acids Res. (19S0) .V: 4057, £P 36,776, U.S. 
4.551.433. deBoer el ai. Proc. Nat'l Acad. Sci. USA (1983) SO: 21-25, and Siebeniist ci 

5 ai. Cell {mO) 20:269. 

Expression systems in yeast include those described in Hinnen el ai. 
Proc. Nal'IAcad Sci. USA .{]9n) 73: 1929; Ito el ai. J. Bacieriol. (1983) 153: 163; 
Kunz el ai, Moi Cell. Bioi (1986) 6: 142; Kunze et ai. .1. Basic Microbiol. (1985) 25: 
141; Gleeson et ai. ./. Gen. Microbiol. (1986) 132: 3459. Roggenkamp el ai, Mol Gen. 

10 Genet. (1986) 202 :302) Das et ai, J. Bacterioi (1984) 755: 1 165; De Louvencourt et 
ai, ./. Bacieriol. (1983) J54: 737, Van den Berg el ai. Bio/Technoloi^' (1990) 8: 135: 
Kunze el ai. ./. Basic Microbiol (1985) 25: 141; Gregg et al. Mol. Cell Biol (1985) 5: 
3376, U.S. 4.837,148, US 4,929,555; Beach and Nurse. Nature (1981) 300: 706; 
Davidow el al. Cwr. Genet. (1985) W: 380, Gaillardin ei al. Curr Genet. (1985) 10: 

15 49. Ballance el al. Biochem. Biophys. Res. Convnun. (1983) 112: 284-289; Tilburn et 
al. Gene (1983) 26: 205-221. Yellon et al. Proc. Nat'l Acad. Sci. USA (1984) 81: 
1470-1474. Kelly and Hynes. EMBO ./. (1985) 4: 475479: EP 244.234. and WO 
91/00357. 

Expression of n^etastatic marker expression constructs in insects can be 
20 carried out as described in U.S. 4,745.051, Friesen el al (1986) "The Regulation of 
Baculovirus Gene Expression" in: The Molecular Biology of Baculoviruses (W. 
Doerfler. ed.). EP 127,839, EP 155,476. and Vlak et al. J. Gen. Virol (1988) 69: 765- 
776, Miller el al. Ann. Rev. Microbiol (1988) 42: 177. Carbonell et al. Gene (1988) 
73: 409. Maeda et al. Nature (1985) 315: 592-594, Lebacq-Verheyden et al. Mol Cell 

25 Biol (1988) 8: 3129; Smith et al. Proc. Nat'l Acad Sci. USA (1985) 82: 8404. 
Miyajima et al. Gene (1987) 58: 273; and Martin ei al. DNA (1988) 7.99. Numerous 
baculoviral strains and variants and corresponding permissive insect host cells from 
hosts are described in Luckow et al. Bio/Technology {\9m 6: 47-55. Miller et al. in 
Genetic Engineering (Setlow, J.K. et al. eds.). Vol. 8 (Plenum Publishing, 1986), pp. 

30 277-279. and Maeda et al. Nature. (1985) 315: 592-594. 
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Mammalian expression of metastatic marker expression constructs can 
be achieved as described in Dijkema ef ai. EMBO ./. (1985) 4\ 761, Gorman et al., 
Proc. Nan Acad Scr USA (1982b) 79: 6777, Boshart e( uL Cell (1985) 41: 521 and 
U.S. 4,399.216. Other features of mammalian expression of melastalic marker 
5 expression constructs can be facilitated as described in Ham and Wallace, Meih. Eni. 
(1979) 5<S: 44, Barnes and Sato, Anal. Biochem. (1980) 102: 255, U.S. 4,767,704, US 
4,657,866, US 4,927,762, US 4.560,655, WO 90/103430, WO 87/00195, and U.S. RE 
30,985, 

Subgenomic polynucleotides of the invention can also be used in gene 
10 delivery vehicles, for the purpose of delivering a metastatic marker mRNA or 
oligonucleotide (either with the sequence of native metastatic marker mRNA or its 
complement), full-length metastatic marker protein, metastatic marker fusion protein, 
metastatic marker polypeptide, or metastatic marker-specific ribozyme or single-chain 
antibody, into a cell preferably a eukaryotic cell. According to the present invention, a 
15 gene delivery vehicle can be. for example, naked plasmid DNA, a viral expression 
vector comprising a metastatic marker subgenomic polynucleotide, or a metastatic 
marker subgenomic polynucleotide in conjunction with a liposome or a condensing 
agent. 

In one embodiment of the invention, the gene delivery vehicle comprises 
20 a promoter and a metastatic marker subgenomic polynucleotide. Preferred promoters 
are tissue-specific promoters and promoters which are activated by cellular 
proliferation, such as the thymidine kinase and thymidylate synthase promoters. Other 
preferred promoters include promoters which are activatable by infection with a virus, 
such as the a- and (3-interferon promoters, and promoters which are activatable by a 
25 hormone, such as estrogen. Other promoters which can be used include the Moloney 
virus LTR. the CMV promoter, and the mouse albumin promoter, 

A metastatic marker gene delivery vehicle can comprise viral sequences 
such as a viral origin of replication or packaging signal. These viral sequences can be 
selected from viruses such as astrovirus, coronavirus. orthomyxovirus, papovavirus. 
30 paramyxovirus, parvovirus, picomavirus, poxvirus, retrovirus, togavirus or adenovirus. 
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In a preferred embodiment, the metastatic marker gene delivery vehicle is a 
recombinanl retroviral vector. Recombinant retroviruses and various uses thereof have 
been described in numerous references including, for example. Mann el al.. Cell 
55; 153, 1983. Cane and Mulligan, Proc. Nat 7 Acad. Sci. USA 8J:6jA9. 1984, Miller el 
al., Human Gene Therapy 1:5-14, 1990, U.S. Patent Nos. 4.405,712, 4,861,719, and 
4.980.289. and PCT Application Nos. WO 89/02.468. WO 89/05.349, and WO 
90/02,806. Numerous retroviral gene delivery vehicles can be utilized in the present 
invention, including for example those described in EP 0.415,731: WO 90/07936; WO 
94/03622; WO 93/25698; WO 93/25234; U.S. Patent No. 5,219.740; WO 9311230; 
WO 9310218; Vile and Hart. Cancer Res. 55:3860-3864. 1993; Vile and Hart. Cancer 
Res. 55:962-967, 1993; Ram ei al.. Cancer Res. 55:83-88. 1993; Takamiya el al., J. 
Ncurosci. Re.s. 55:493-503, 1992; Baba ef al.. J. Newosur,z 79:729-735. 1993 (U.S. 
Patent No. 4.777,127, GB 2,200,651, EP 0.345.242 and W09 1/02805). 

Particularly preferred retroviruses are derived from retroviruses which 
include avian leukosis virus (ATCC Nos. VR-535 and VR-247). bovine leukemia virus 
(VR-1315). murine leukemia virus (MLV), mink-cell focus-inducing virus (Koch ei al., 
.;. Vir. ■/y:828. 1984; and Oliff f/ al.. J. Vir. -/(V:542. 1983). murine sarcoma virus 
(ATCC Nos. VR-844. 45010 and 45016), reticuloendolheliosis virus (ATCC Nos VR- 
994. VR-770 and 45011). Rous sarcoma virus. Mason-Pfizer monkey virus, baboon 
endogenous virus, endogenous feline retrovirus (e.^,'.. RD114). and mouse or rat gL30 
sequences used as a retroviral vector. Particularly preferred strains of MLV from v^'hich 
recombinant retroviruses can be generated include 4070A and 1504A (Hartley and 
Rowe, ./. Vir. 79:19, 1976). Abelson (ATCC No. VR-999). Friend (ATCC No. VR- 
245), Graffi (Ru et al.,J. Vir. (57:4722, 1993; and Yanlchev Neoplasma 2(3:397, 1979). 
Gross (ATCC No. VR-590), Kirsten (Albino ei al.. J. Exp. Med J64:]7\0, 1986), 
Harve>- sarcoma virus (Manly el al.. J. Vir. 62:3540. 1988; and Albino el al., J. Exp. 
Med 164:\1\Q, 1986) and Rauscher (ATCC No. VR-998). and Moloney MLV (ATCC 
No. VR-190). A particularly preferred non-mouse reiroviais is Rous sarcoma virus. 
Preferred Rous sarcoma viruses include Bratislava (Manly ei ai.J. Vir. (52:3540, 1988; 
and Albino et ai.J. Exp. Med 164:\1\Q, 1986). Bryan high titer (e.i^.. ATCC Nos. VR- 
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334, VR.657. VR.726, VR-659, and VR.728), Bryan standard (ATCC No. VR-140), 
Carr-Zilber (Adgighitov et al.. Neoplasma 27:159, 1980), Engelbreth-Holm (Laurent cf 
ai. Biochem Biophys Acta 908:241, 1987), Harns, Prague (e.g.. ATCC Nos. VR-772, 
and 45033), and Schmidl-Ruppin {eg,. ATCC Nos. VR-724, VR-725, VR-354) viruses. 

5 Any of the above retroviruses can be readily utilized in order to assemble 

or construct retroviral metastatic marker gene delivery vehicles given the disclosure 
provided herein and standard recombinant techniques {e.g.. Sambrook el aL, Molecular 
Cloning: A Laboratory ManuaL 2d ed.. Cold Spring Harbor Laboratory Press, 1989, 
and Kunklc, PNAS cV2:488, 1985) known in the art. Portions of retroviral Metastatic 

10 marker expression vectors can be derived from different retroviruses. For example, 
retrovector LTRs can be derived from a murine sarcoma virus, a tRNA binding site 
from a Rous sarcoma virus, a packaging signal from a murine leukemia virus, and an 
origin of second strand synthesis from an avian leukosis virus. These recombinant 
retroviral vectors can be used to generate transduction competent retroviral vector 

15 particles by introducing them into appropriate packaging cell lines (sec Serial No. 
07/800,921, filed November 29, 1991). Recombinant retroviruses can be produced 
which direct the site-specific integration of the recombinant retroviral genome into 
specific regions of the host cell DNA. Such site-specific integration can be mediated by 
a chimeric integrase incorporated into thp retroviral particle {see Serial No. 08/445.466 

20 filed May 22. 1995). It is preferable that the recombinant viral gene delivery vehicle is 
a replication-defective recombinant virus. 

Packaging cell lines suitable for use with the above-described retroviral 
gene delivery vehicles can be readily prepared {see Serial No. 08/240,030, filed May 9. 
1994; see also WO 92/05266) and used to create producer cell lines (also termed vector 

25 cell lines or "VCLs") for production of recombinant viral particles. In particularly 
preferred embodiments of the present invention, packaging cell lines are made from 
human {e.g., HT1080 cells) or mink parent cell lines, thereby allowing production of 
recombinant retroviral gene delivery vehicles which are capable of surviving 
inactivation in human serum. The construction of recombinant retroviral gene delivery 

30 vehicles is described in detail in WO 91/02805. These recombinant retroviral gene 
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de„veo, vehicles can be used ,o generate transduction competent retroviral particles by 
.ntroducng them into appropriate packaging cell lines Serial No. 07/S00 921, 
StmUarly. adenovtrus gene dehvet, vehicles can also be readily prepared and uulized 
g.ven the disclosure provided herein (.e. also Berkner. aor.cA™,... 5:616-627 1988 
3 and Rosenfeid Science 25.:431-434, ,991, WO 93/07283, WO 93/06223, and 

WO 93/07282). 

A metastatic marker gene deliver vehicJe can also be a recombinant 

adenoviral gene delivery vehicle. Such vehicles can r-..^-. 

i-ii venicies can be readily prepared and utilized 

gwen the disclosure provided herein Csee Berkner, Bio,ec>,„>,ues 5:6.6, 1988 and 
° ''91. WO 93/07283, WO 93/06223. and WO 

93/07282), Adeno-associated viral metastatic marker gene delivery vehicles can also be 
constructed and used to dehver metastatic marker amino acids or nucleottdes The use 
of adeno-associated vital gene delivery vehicles /„ v„™ is descr.bed in Chatterjee e, a, 
Science 1485-1488 (1992). Walsh Proc ,.1 Aca,. Sc, 89: 7257-7.6, 

(1992). Walsh „ y am. In.es,. 9,: 1440-H48 ,,994), Flotte « „/., J Biol Chen 
268: 3781-3790 (1993), Ponnazhagan e, al. J. E.,. Me,. ,79: 733-738 ,1994,. Miller e, 
at. Proc. Nan Acad. Sci. 91: 10183-10.87 (1994), Einerhand e,M.. Gene Ther 2- 336 
343 (1995). Luc e, al. E.p. Hema.ol 23: 126,-,267 (1995). and Zhou e, al Gene 
Therapy S: 223-229 ,1996). /„ v,„o use of Utese vehicles ,s described in Flotte e, al 

P''oc Nail Acad Sci. 90- ^06\J-\nf,M i\aQ-^ j„ ,. 

l"617(199^).andKapl,tt«a/., Nalure Gene,. S:\4i- 

m another embodiment of the invention, a metastatic marker gene 
dehve,^ vehicle is derived from a togavtrus. Prefetred togaviruses include alphav.ruses 
m pamcular those described in U.S. Serial No. 08/405,627. filed March ,5 , 995 WO 
95/07994. Alpha viruses, including Sindbis and ELVS viruses can be gene deliverv 
vehtcles for metastattc marker polynucleotides. Alpha vimses are described in WO 
94/2,792. WO 92/10578 and WO 95/07994. Several different alphavirus gene del.verv 
vehtcle systems can be constructed and used to dehver metastatic marker subgenomic 
polynucleotides to a cell according to the present invention. Representative examples 
of such systems include those described in U.S. Patents 5.091.309 and 5^17 879 
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Particularly preferred alphavirus gene delivery vehicles for use in the present invention 
include those which are described in WO 95/07994, and U.S. Serial No. 08/405,627. 

Preferably, the recombinant viral vehicle is a recombinant alphavirus 
viral vehicle based on a Sindbis virus. Sindbis constructs, as well as numerous similar 

5 constructs, can be readily prepared essentially as described in U.S. Serial No. 
08/198,450. Sindbis viral gene delivery vehicles typically comprise a 5' sequence 
capable of initiating Sindbis virus transcription, a nucleotide sequence encoding Sindbis 
non-structural proteins, a viral junction region inactivated so as to prevent subgenomic 
fragment transcription, and a Sindbis RNA polymerase recognition sequence. 

10 Optionally, the viral junction region can be modified so that subgenomic polynucleotide 
transcription is reduced, increased, or maintained. As will be appreciated by those in 
the art, corresponding regions from other alphaviruses can be used in place of those 
described above. 

The viral junction region of an alphavirus-derived gene delivery vehicle 
15 can comprise a first viral junction region which has been inactivated in order to prevent 
transcription of the subgenomic polynucleotide and a second viral junction region 
which has been modified such that subgenomic polynucleotide transcription is reduced. 
An alphavirus-derived vehicle can also include a 5' promoter capable of initiating 
svnthesis of viral RNA from cDNA and a 3' sequence which controls transcription 
20 termination. 

Other recombinant togaviral gene delivery vehicles which can be utilized 
in the present invention include those derived from Semliki Forest virus (ATCC VR-67; 
ATCC VR-1247), Middleberg virus (ATCC VR-370), Ross River virus (ATCC VR- 
373; ATCC VR-1246), Venezuelan equine encephalitis virus (ATCC VR923; ATCC 

25 VR-1250; ATCC VR-1249; ATCC VR-532), and those described in U.S. Patents 
5,091309 and 5,217.879 and in WO 92/10578. The Sindbis vehicles described above, 
as well as numerous similar constructs, can be readily prepared essentially as described 
in U.S. Serial No. 08/198,450. 

Other viral gene delivery vehicles suitable for use in the present 

30 invention include, for example, those derived from poliovirus (Evans et oL, Nature 
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5JP:385, 1989, and Sabin et ai, J. Biol. Standardization hi 15, 1973) (ATCC VR-58); 
rhinovirus (Arnold et al.,J. Cell. Biochem. L401, 1990) (ATCC VR-1 1 10); pox viruses, 
such as canary pox virus or vaccinia virus (Fisher-Hoch et al.,. PNAS ^6:31.7, 1989:. 
Flexnerer fl/.. Arm. N.Y. Acad. Sci. 569:86, 1989; Flexner et ai. Vaccine 8:17, 1990; 

5 U.S. 4,603,112 and U.S. 4,769,330; WO 89/01973) (ATCC VR-1 1 1; ATCC VR-2010); 
SV40 (Mulligan et al.. Nature 277.T08. 1979) (ATCC VR-305), (Madzak et al., J. Gen. 
Vir. 75:1533, 1992); influenza virus (Luytjes et al.. Cell 59:1 107, 1989; McMicheal et 
al.. The New England Journal of Medicine i09: 13, 1983; and Yap et al.. Nature 
27i:238, 1978) (ATCC VR-797); parvovirus such as adeno-associated virus (Samulski 

10 et al., J. Vir 63:3822, 1989, and Mendelson et al.. Virology 766:154, 1988) (ATCC 
VR-645); herpes simplex virus (Kit et al.. Adv. Exp. Med. Biol. 275:219, 1989) (ATCC 
VR-977; ATCC VR-260); Nature 277: 108, 1979); human immunodeficiency virus 
(EPO 386,882, Buchschacher et ai, J. Vir. 66:2731, 1992); measles virus (EPO 
440,219) (ATCC VR-24); A (ATCC VR-67; ATCC VR-1 247), Aura (ATCC VR-368), 

15 Bebaru virus (ATCC VR-600; ATCC VR- 1 240), Cabassou (ATCC VR-922), 
Chikungunya virus (ATCC VR-64; ATCC VR-1 241), Fort Morgan (ATCC VR-924), 
Getah virus (ATCC VR-369; ATCC VR-1243), Kyzylagach (ATCC VR-927), Mayaro 
(ATCC VR-66), Mucambo virus (ATCC VR-580; ATCC VR-1 244), Ndumu (ATCC 
VR-371). Pixuna virus (ATCC VR-372; ATCC VR-1 245). Tonate (ATCC VR-925). 

20 Triniti (ATCC VR-469), Una (ATCC VR-374), Whataroa (ATCC VR-926), Y-62-33 
(ATCC VR-375), OTMyong virus. Eastern encephalitis virus (ATCC VR-65; ATCC VR- 
1242), Western encephalitis virus (ATCC VR-70; ATCC VR-1 251; ATCC VR-622; 
ATCC VR-1 252), and coronavirus (Hamre et al., Proc. Soc. Exp. Biol. Med 727:190, 
1 966) (ATCC VR-740). 

25 A subgenomic metastatic marker polynucleotide of the invention can 

also be combined with a condensing agent to form a gene delivery vehicle. In a 
preferred embodiment, the condensing agent is a polycation. such as polylysine. 
polyarginine, polyornithine, protamine, spermine, spermidine, and putrescine. Many 
suitable methods for making such linkages are known in the art (see, for example, Serial 

30 No. 08/366.787, filed December 30, 1 994). 
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In an alternative embodiment, a metastatic marker subgenomic 
polynucleotide is associated with a liposome to form a gene delivery vehicle. 
Liposomes are small, lipid vesicles comprised.of an aqueous compartment enclosed, by 
a lipid bilayer, typically spherical or slightly elongated structures several hundred 

5 Angstroms in diameter. Under appropriate conditions, a liposome can fuse with the 
plasma membrane of a cell or with the membrane of an endocytic vesicle within a cell 
. which has intemalized the liposome, thereby releasing its contents into the cytoplasm. 
Prior to interaction with the surface of a cell, however, the liposome membrane acts as a 
relatively impermeable barrier which sequesters and protects its contents, for example, 

10 from degradative enzymes. Additionally, because a liposome is. a synthetic structure, 
specially designed liposomes can be produced which incorporate desirable features. 
See Stryer, Biochemistry, pp. 236-240, 1975 (W.H. Freeman, San Francisco, CA); 
Szoka et al. Biochim. Biophys. Acta 600:1^ 1980; Bayer et aL, Biochim. Biophys, Acta. 
550:464, 1979; Rivnay et al. Metk Enzymol 7^9:119, 1987; Wang et al, PNAS 84: 

15 7851, 1987, Plant et aL, AnaL Biochem. 776:420. 1989, and U.S. Patent 4,762,915. 
Liposomes can encapsulate a variety of nucleic acid molecules including DNA, RNA, 
plasmids, and expression constructs comprising metastatic marker subgenomic 
polynucleotides such those disclosed in the present invention. 

Liposomal preparations for use in the present invention include cationic 

20 (positively charged), anionic (negatively charged) and neutral preparations. Cationic 
liposomes have been shown to mediate intracellular delivery of plasmid DNA (Feigner 
et al, Proc. Natl Acad. Sci. USA <§4:74 13-74 16, 1987), mRNA (Malone et al, Proc. 
Natl Acad. Sci. USA 56:6077-6081, 1989), and purified transcription factors (Debs et 
al.,J. Biol. Chem. 265:10189-10192, 1990), in functional form. Cationic liposomes are 

25 readily available. For example, N[l-23-dioleyloxy)propyl]-N,N,N-triethylammonium 
(DOTMA) liposomes are available under the trademark Lipofectin. from GIBCO BRL, 
Grand Island. NY. See also Feigner et al, Proc, Natl Acad Sci, USA 91: 5148- 
5152.87, 1994. Other commercially available liposomes include Transfectace 
(DDAB/DOPE) and DOTAP/DOPE (Boerhinger). Other cationic liposomes can be 

30 prepared from readily available materials using techniques well known in the art. See. 
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e.g., Szoka et ai. Proc. Nafl Acad. Sci. USA 75:4194-4198, 1978; and WO 90/1 1092 
for descriptions of the synthesis of DOTAP (l,2-bis(oleoyloxy)-3- 
(trimethylanimonio)propane) liposonaes. 

Similarly, anionic and neutral liposomes are readily available; such as 
5 from Avanti Polar Lipids (Birmingham, AL), or can be easily prepared using readily 
available materials. Such materials include phosphatidyl choline, cholesterol, 
phosphatidyl ethanolamine, dioleoylphosphatidyl choline (DOPC), 
dioleoylphosphatidyl glycerol (DOPG), dioleoylphoshatidyl ethanolamine (DOPE), 
among others. These materials can also be mixed v^^ith the DOTMA and DOTAP 
10 starting materials in appropriate ratios. Methods for making liposomes using these 
materials are well known in the art. 

The liposomes can comprise multilammelar vesicles (MLVs), small 
unilamellar vesicles (SUVs), or large unilamellar vesicles (LUVs). The various 
liposome-nucleic acid complexes are prepared using methods known in the art. See, 
15 e.g., Slraubinger e/ ai, METHODS OF Immunology (1983), Vol. 101, pp. 512-527; 
Szoka el ai, Proc. Nat'J Acad. Sci. USA 57. 3410-3414, 1990; Papahadjopoulos et al, 
Biochim. Biophys. Acta 39-^:483, 1975; Wilson et ai. Cell 17:11, 1979; Deamer and 
Bangham, Biochim. Biophys. Acta 443:619, 1976; Ostro et ai. Biochem. Biophys. Res. 
Commun. 76:836 . 1977; Fraley et al. Proc. Nafl Acad Sci. USA 76:3348, 1979; Enoch 
20 and Strittmatter, Proc. Nat'l Acad. Sci. USA 76:145. 1979: Fraley et al.,J. Biol. Chem. 
255:10431. 1980; Szoka and Papahadjopoulos, Proc. Nat'l Acad. Sci. USA 75:145. 
1979; and Schaefer-Ridder et al, Science 275:166, 1982. 

In addition, lipoproteins can be included with a metastatic marker 
subgenomic polynucleotide for delivery to a cell. Examples of such lipoproteins 
25 include chylomicrons, HDL, IDL. LDL, and VLDL. Mutants, fragments, or fusions of 
these proteins can also be used. Modifications of naturally occurring lipoproteins can 
also be used, such as acetylated LDL. These lipoproteins can target the delivery of 
polynucleotides to cells expressing lipoprotein receptors. Preferably, if lipoproteins are 
included with a polynucleotide, no other targeting ligand is included in the composition. 
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In another embodiment, naked metastatic marker subgenomic 
polynucleotide molecules are used as gene delivery vehicles, as described in WO 
90/11092 and Patent 5,580.859. Such, gene delivery vehicles can be either, 
metastatic marker DNA or RNA and. in certain embodiments, are linked to killed 
adenovirus. Curie! et al. Hum, Gene, Ther. i: 147- 154. 1992. Other suitable vehicles 
include DNA-ligand (Wu et ah, J. Biol Chem. 26-/:16985-16987, 1989), lipid-DNA 
combinations (Feigner el al, Proc. Natl Acad, Sci, USA 5^:7413 7417, 1989), 
liposomes (Wang et aL, Proc. Natl Acad ScL 5-=/:785 1-7855, 1987) and 
microprojectiles (Williams et al, Proc, Natl Acad, Sci, 55:2726-2730, 1991). 

One can increase the efficiency of naked metastatic marker subgenomic 
polynucleotide uptake into cells by coating the polynucleotides onto biodegradable 
latex beads. This approach takes advantage of the observation that latex beads, when 
incubated with cells in culture, are efficiently transported and concentrated in the 
perinuclear region of the cells. The beads will then be transported into cells when 
injected into muscle. Metastatic marker subgenomic polynucleotide-coated latex beads 
v^ill be efficiently transported into cells after endocytosis is initiated by the latex beads 
and thus increase gene transfer and expression efficiency. This method can be 
improved further by treating the beads to increase their hydrophobicity, thereby 
facilitating the disruption of the endosome and release of metastatic marker 
subgenomic polynucleotides into the cytoplasm. 

The invention provides a method of detecting metastatic marker gene 
expression in a biological sample. Detection of metastatic marker gene expression is 
useful, for example, for identifying metastases or for determining metastatic potential in 
a tissue sample, preferably a tumor. Appropriate treatment regimens can then be 
designed for patients who are at risk for developing metastatic cancers in other organs 
of the body. 

The body sample can be. for example, a solid tissue or a fluid sample. 
Protein or nucleic acid expression products can be detected in the body sample. In one 
embodiment, the body sample is assayed for the presence of a metastatic marker 
protein. A metastatic marker protein comprises a sequence encoded by a nucleotide 
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sequence shown in SEQ ID NOS:l-85 or its complement and can be detected using the 
marker protein-specific antibodies of the present invention. The antibodies can be 
labeled^ for example, with a radioactive, fluorescent, biotinylated, or enzymatic lag and 
detected directly^ or can be detected using indirect inamunochemical methods^ using a 
labeled secondary antibody. The presence of the nietastatic marker proteins can be 
assayed, for example, in tissue sections by immunocytochemistry, or in lysates, using 
Western blotting, as is known in the art. 

In another embodiment, the body sample is assayed for the presence of 
marker protein mRNA. A sample can be contacted with a nucleic acid hybridization 
probe capable of hybridizing with the mRNA corresponding the selected polypeptide. 
Still further, the sample can be subjected to a Northern blotting technique to detect 
mRNA, indicating expression of the polypeptide. For those techniques in which mRNA 
is detected, the sample can be subjected to a nucleic acid amplification process whereby 
the mRNA molecule or a selected part thereof is amplified using appropriate nucleotide 
primers. Other RNA detection techniques can also be used, including, but not limited 
to, m 5/7 w hybridization. 

Marker protein-specific probes can be generated using the cDN A 
sequences disclosed in SEQ ID NOS:l-85. The probes are preferably at least 15 to 50 
nucleotides in length, although they can be at least 8, 10 J L 12, 20, 25, 30, 35, 40. 45, 
60, 75, or 100 or more nucleotides in length. The probes can be synthesized chemically 
or can be generated from longer polynucleotides using restriction enzymes. The probes 
can be labeled, for example, with a radioactive, biotinylated, or fluorescent tag. 

Optionally, the level of a particular metastatic marker expression product 
in a body sample can be quantitated. Quantitation can be accomplished, for example, 
by comparing the level of expression product detected in the body sample with the 
amounts of product present in a standard curve. A comparison can be made visually or 
using a technique such as densitometry, with or without computerized assistance. For 
use as controls, body samples can be isolated from other humans, other non-cancerous 
organs of the patient being tested, or non-metastatic' breast or colon cancer from the 
patient being tested. 
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Polynucleotides encoding metastatic marker-specific reagents of the 
invention, such as antibodies and nucleotide probes, can be supplied in a kit for 
detecting marker gene expression products in a biological sample.. The kit can also 
contain buffers or labeling components, as well as instructions for using the reagents to 
5 detect the marker expression products in the biological sample. 

If expression of a metastatic marker gene having a nucleotide sequence 
shown in SEQ ID N0S:2, 4, 9, 13, 14, 19, 26, 29, 39-41, 48, 55, 57, 60, 63, 64, 72, 73, 
82, or 83 is detected, the biological sample contains cancer cells which will likely 
metastasize to the lung. If expression of a metastatic marker gene having a nucleotide 

10 sequence shown in SEQ ID N0S:1, 5, 1 1 . 18, 20, 22, 24, 30, 33, 35, 36, 38, 45, 52, 58, 
65, 66, 70, 74, 76, or 80 is detected, the biological sample contains cancer cells which 
will likely metastasize to the bone and/or lung. On the other hand, if expression of a 
metastatic marker gene having a nucleotide sequence shown in SEQ ID N0S:3, 7, 8, 10, 
12, 15-17, 21, 23, 25, 28, 31, 34, 37, 42-44, 46, 47, 49-51, 53, 59, 61, 62; 67, 68, 75, 77- 

15 79, 81, 84, or 85 is detected, the biological sample contains cancer cells which will 
likely not metastasize. Detection of expression of a metastatic marker gene comprising 
the nucleotide siequence shown in SEQ: ID NO:56 also indicates that the biological 
sample contains cancer cells which will likely metastasize. This infonnation can be 
used, for example, to design treatment regimens. Treatment regiments can include 

20 altering expression of one or more metastatic marker genes, as desired. Metastatic 
marker gene expression can be altered for therapeutic purposes, as described below, or 
can be used to identify therapeutic agents. 

In one embodiment of the invention, expression of a metastatic marker 
gene whose expression is up-regulated in metastatic cancer is decreased using a 

25 ribozyme, an RN A molecule with catalytic activity. See, e.g., Cech, \9%1 .Science 23 6\ 
1532-1539; Cech. 1990, Ann. Rev. Biochem. 59:543-568; Cech, 1992, Curr. Opin. 
Struct, Biol. 2: 605-609; Couture and Stinchcomb, 1996, Trends Genet. 12: 510-515. 
Ribozymes can be used to inhibit gene function by cleaving an RNA sequence, as is 
known in the art (e.g., Haseloff e/ a/., U.S. 5,64 L673). 
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Coding sequences of metastatic marker genes can be used lo generate 
ribozymes which will specifically bind to mRNA transcribed from a metastatic marker 
gene. Methods of designing and constructing ribozymes which can cleave other RNA. 
molecules in trans m a highly sequence specific manner have been developed and 

5 described m the an (see Haseloff, J. ei al. (1988), Naiure ijV:585o91). For examole, 
the cleavage activity of ribozymes can be targeted to specific RNAs by engineering a 
discrete "hybridization" region into the ribozyme. The hybridization region contains a 
sequence complementary to the target R.NA and thus specifically hybridizes with the 
target (see, for example, Geriach, W. L. e/ al.. EP 321,201). Longer complementary 

10 sequences can be used to increase the affinity of the hybridization sequence for the 
target. The hybridizing and cleavage regions of the ribozyme can be integrally related; 
thus, upon hybridizing to the target RNA through the complementary regions, the 
catalytic region of the ribozyme can cleave the target. 

Ribozymes can be introduced into cells as part of a DNA construct, as is 

15 known in the an. The DNa\ construct can also include transcriptional regulatory 
elements, such as a promoter element, an enhancer or UAS element, pnd a 
transcriptional terminator signal, for controlling the transcription of the ribozyme in the 
cells. 

Mechanical methods, such as microinjection, liposome-mediated 
20 iransfection. electroporation, or calcium phosphate precipitation, can be used to 
introduce a ribozyme-containing DNA construct into cells whose division it is desired 
to decrease, as described above. Alternatively, if it is desired that a DNA construct be 
stably retained by the cells, the DNA construct can be supplied on a plasmid and 
maintained as a separate element or integrated into the genome of the cells, as is known 
25 in the an. 

As taught in Haseloff e/ al.. U.S. 5,641,673. ribozymes can be 
engineered so that their expression will occur in response lo factors which induce 
expression of metastatic marker genes. Ribozymes can also be engineered to provide an 
additional level of regulation, so that destruction of mRNA occurs only when both a 
30 ribozyme and a metastatic marker gene are expressed in the cells. 



wooomT30 



27 



PCT/US99/24222 ~ 



Expression of a metastatic marker gene can also be altered using an 
antisensc oligonucleotide sequence. The anlisense sequence is complementary to at 
least a portion of the coding sequence of a metastatic marker gene havmg a nucleotide 
sequence shown in SEQ ID NOS: 1-85. The complement of a nucleotide sequence 
shown in SEQ ID NOS: 1-85 is a contiguous sequence of nucleotides which form 
Watson-Crick basepairs with a contiguous nucleotide sequence shown in SEQ ID NOS: 
1-85. 

Preferably, the antisensc oligonucleotide sequence is at least six 
nucleotides in length, but can be at least about 8. 12. 15. 20. 25, 30. 35. 40. 45, or 50 
nucleotides long. Longer sequences can also be used. Antisensc oligonucleotide 
molecules can be provided in a DNA construct and introduced into cells whose division 
is to be decreased, as described above. 

Antisensc oligonucleotides can comprise deoxyribonuclcotides. 
ribonucleotides, or a combination of both. Oligonucleotides can be synthesized 
manually or by an automated synthesizer, by covalently linking the 5' end of one 
nucleotide with the 3' end of another nucleotide with non-phosphodiester 
intemuclcotide linkages such alkylphosphonaies, phosphorolhioates. 
phosphorodithioatcs. alkylphosphonothioates, alkylphosphonates. phosphoramidates. 
phosphate esters, carbamates, acetamidatc. carboxymethyl esters, carbonates, and 
phosphate tnesters. S.c Brown, m4. Metl, Mol. Biol. 20:1-8; Sonveaux. 1994. Mch. 
Mol. Biol. 2(5:1-72; Uhlmann el al., 1990. Chem. Rev. 90:543-583. 

Although preci-se complementarity is not required for successful duplex 
formation between an antisense molecule and the complementary coding sequence of a 
metastatic marker gene, antisense molecules with no more than one mismatch are 
i prefeired. One skilled in the art can easily use the calculated melting point of a 
metastatic marker gene anti sense-sense pair to determine the degree of ^ mismatching 
which will be tolerated between a particular antisense oligonucleotide and a particular 

coding sequence of the selected gene. 

Antisense oligonucleotides can be modified without affecting their 
0 ability to hybridize to a metastatic marker protein coding sequence. These 
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modifications can be internal or at one or both ends of the antisense molecule. For 
example, internucleoside phosphate linkages can be modified by adding cholesteryl or 
diamine moieties with varyina numbers of carbon residues between tlie amino groups 
and terminal ribose. Modified bases and/or sugars, such as arabinose instead of ribose, 
5 or a 3\ 5'-substituted oligonucleotide in which the 3' hydroxy! group or the 5' phosphate 
group are substituted, can also be employed in a modified antisense oligonucleotide. 
These modified oligonucleotides can be prepared by methods well known in the art. 
Agrawal et aL, 1992, Trends BiolechnoL 10:152-158: Uhlmann ef aL. 1990, Chem, 
Rev. 9r;:543-584; Uhlmann e/ aL, 1987, Tetrahedron. Lett. 27^:3539-3542. 

10 Antibodies of the invention which specifically bind to a metastatic 

marker protein can also be used to alter metastatic marker gene expression. By 
antibodies is meant antibodies and parts or derivatives thereof, such as single chain 
antibodies, that retain specific binding for the protein. Specific antibodies bind to 
metastatic marker proteins and prevent the proteins from functioning in the celL 

15 Polynucleotides encoding specific antibodies of the invention can be introduced into 
cells, as described above. 

Marker proteins of the present invention can be used to screen for drugs 
which have a therapeutic anti-metastatic effect. The effect of a lest compound on 
metastatic marker protein synthesis can also be used to identify test compounds which 

20 modulate metastasis. Test compounds which can be screened include any substances, 
whether natural products or synthetic, which can be administered to the subject- 
Libraries or mixtures of compounds can be tested. The compounds or substances can 
be those for which a pharmaceutical effect is previously known or unknown. 

A cell is contacted with a test compound. The cell can be any cell, such 

25 as a colon cancer cell, which ordinarily synthesizes the metastatic marker protein being 
measured. For example. Tables 1 and 2 provide appropriate cell types which can be 
used for screening assays. 

Synthesis of metastatic marker proteins can be measured by any means 
for measuring protein synthesis known in the art, such as incorporation of labeled amino 

30 acids into proteins and detection of labeled metastatic marker proteins in a 
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polyacrylamide gel. The amount of metastatic marker proteins can be detected, for 
example, using metastatic marker protein-specific antibodies of the invention in 
Western blots. The amount of the metastatic marker proteins synthesized m the 
presence or absence of a test compound can be determined by any means known m the 
art, such as comparison of the amount of metastatic marker protein synthesized with the 
amount of the metastatic marker proteins present in a standard curve. 

The effect of a test compound on metastatic marker protein synthesis can 
also be measured by Northern blot analysis, by measuring the amount of metastatic 
marker protein mRNA expression in response to the test compound using metastatic 
marker protein specific nucleotide probes of the invention, as is known in the art. 

Typically, biological sample is contacted v^ith a range of concentrations 
of the test compound, such as 1.0 nM, 5,0 nM, 10 nM, 50 nM, 100 nM, 500 nM, 1 mM, 
10 mM, 50 mM, and 100 mM. Preferably, the test compound increases or decreases 
expression of a metastatic marker protein by 60%. 75%. or 80%. More preferably, an 
increase or decrease of 85%. 90%, 95%, or 98% is achieved. 

The invention provides compositions for increasing or decreasing 
expression of metastatic marker protein. Therapeutic compositions for increasing 
meta.stalic marker gene expression are desirable for markers which are down-regulated 
in metastatic cells. These compositions comprise polynucleotides encoding all or at 
least a portion of a metastatic marker protein gene cxprcjsion product. Preferably, the 
therapeutic composition contains an expression construct comprising a promoter and a 
polynucleotide segment encoding at least a portion of the metastatic marker protein 
which is effective to increase or decrease metastatic potential. Portions of metastatic 
marker genes or proteins which are effective to decrease metastatic potential of a cell 
can be determined, for example, by introducing various portions of metastatic marker 
genes or polypeptides into metastatic cell lines, such as MDA-MB-23]. MDA-MB-435- 
Kml2C. or Kml2L4, and assaying the division rate of the cells or the ability of the cells 
to form metastases when implanted in v/vo, as is known in the art. Non-metastatic cell 
lines, such as MCF-7, can be used to assay the ability of a portion of a metastauc 
marker protein to increase expression of a metastatic marker gene. 
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Within the expression construct, the polynucleotide segment is located 
downstream from the promoter, and transcription of the polynucleotide segment 
initiates at the promoter, A more complete description of gene, trar>sfer, vectors^ 
especially retroviral vectors is contained in U.S. Serial No. 08/869309, which is 
incorporated herein by reference. 

Decreased metastatic marker gene expression is desired in conditions in 
which the marker gene is up-regulated in metastatic cancer. Therapeutic compositions 
for treating these disorders comprise a polynucleotide encoding a reagent which 
specifically binds to a metastatic marker protein expression product, as disclosed herein. 

Metastatic marker therapeutic compositions of the invention can 
comprise a pharmaceutical ly acceptable carrier. Pharmaceutically acceptable carriers 
are well known to those in the art. Such carriers include, but are not limited to. large, 
slowly metabolized macromolecules, such as proteins, polysaccharides, polylactic acids, 
polyglycolic acids, polymeric amino acids, amino acid copolymers, and inactive virus 
particles. Pharmaceutically acceptable salts can also be used in the composition, for 
example, mineral salts such as hydrochlorides, hydrobromides, phosphates, or sulfates, 
as well as the salts of organic acids such as acetates, proprionates, malonates, or 
benzoates. 

Therapeutic compositions can also contain liquids, such as water, saline, 
glycerol, and ethanol, as well as substances such as wetting agents, emulsifying agents, 
or pH buffering agents. Liposomes, such as those described in U.S. 5,422,120, WO 
95/13796, WO 91/14445, or EP 524,968 Bl, can also be used as a carrier for the 
therapeutic composition. 

Typically, a therapeutic metastatic marker composition is prepared as an 
injectable, either as a liquid solution or suspension; however, solid forms suitable for 
solution in. or suspension in, liquid vehicles prior to injection can also be prepared. A 
metastatic marker composition can also be formulated into an enteric coated tablet or 
gel capsule according to known methods in the art, such as those described in U.S. 
4,853,230, EP 225,189, AU 9,224,296, and AU 9.230,801. 
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Administration of the metastatic marker therapeutic agents of the 
invention can include local or systemic administration, including injection, oral 
administration, particle gun^ or catheterized administration, and topical administration. 
Various methods can be used to administer a therapeutic metastatic marker composition 
directly to a specific site in the body. 

For treatment of tumors, including metastatic lesions, for example, a 
therapeutic metastatic marker composition can be injected several times in several 
different locations within the body of tumor. Alternatively, arteries which serve a 
tumor can be identified, and a therapeutic composition injected into such an artery, in 
order to deliver the composition directly into the tumor. 

A tumor which has a necrotic center can be aspirated and the 
composition injected directly into the now empty center of the tumor. A therapeutic 
metastatic marker composition can be directly administered to the surface of a tumor, 
for example, by topical application of the composition. X-ray imaging can be used to 
assist in certain of the above deliver)' methods. Combination therapeutic agents, 
including a metastatic marker proteins or polypeptide or a metastatic marker 
subgenomic polynucleotide and other therapeutic agents, can be administered 
simultaneously or sequentially. 

Receptor-mediated targeted delivery can be used to deliver therapeutic 
compositions containing metastatic marker subgenomic polynucleotides, proteins, or 
reagents such as antibodies, ribozymes, or antisense oligonucleotides to specific tissues. 
Receptor-mediated delivery techniques are described in, for example, Findeis et al. 
(1993), Trends in BiotechnoL 77, 202-05; Chiou et al. (1994), GENE THERAPEUTICS: 

METHODS AND APPLICATIONS OF DIRECT GENE TRANSFER (J.A. Wolff, ed.); Wu & Wu 

(1988), J. Biol. Chem. 263. 621-24; Wu et al. (1994)../ BioL Chenr 269, 542-46; Zenke 
et al. (1990). Proc. Not'l Acad Sci, U.S.A. 87, 3655-59: Wu et aL (1991). J. BioL Chem. 
266,338-42. 

Alternatively, a metastatic marker therapeutic composition can be 
introduced into human cells ex vivo, and the cells then replaced into the human. Cells 
can be removed from a variety of locations including, for example, from a selected 
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tumor or from an affected organ. In addition, a therapeutic composition can be inserted 
into non-affected, for example, dermal fibroblasts or peripheral blood leukocytes. If 
desired, particular fractions of cells such as a T cell subset or stem cells^ can alsch be 
specifically removed from the Wood (see, for example. PGT WO 91/16116). The 

5 removed cells can then be contacted with a metastatic marker therapeutic composition 
utilizing any of the above-described techniques, followed by the return of the cells to 
the human, preferably to or within the vicinity of a tumor or other site to be treated. 
The methods described above can additionally comprise the steps of depleting 
fibroblasts or other non-contaminating tumor cells subsequent to removing tumor cells 

10 from a human, and/or the step of inactivating the cells, for example, by irradiation. 

Both the dose of a metastatic marker composition and the means of 
administration can be determined based on the specific qualities of the therapeutic 
composition, the condition, age, and weight of the patient, the progression of the 
disease, and other relevant factors. Preferably, a therapeutic composition of the 

15 invention increases or decreases expression of the metastatic marker genes by 50%, 
60%, 70%, or 80%>- Most preferably, expression of the metastatic marker genes is 
increased or decreased by 90%, 95%>, 99%, or 100%. The effectiveness of the 
mechanism chosen to alter expression of the metastatic marker genes can be assessed 
using methods well known in the art, such as hybridization of nucleotide probes to 

20 mRNA of the metastatic marker genes, quantitative RT-PCR, or detection of an the 
metastatic marker proteins using specific antibodies of the invention. 

If the composition contains the metastatic marker proteins, polypeptide, 
or antibody, effective dosages of the composition are in the range of about 5 |ig to about 
50 |ig/kg of patient body weight, about 50 yig to about 5 mg/kg, about 100 ^ig to about 

25 500 |ig/kg of patient body weight, and about 200 to about 250 [.ig/kg. 

Therapeutic compositions containing metastatic marker subgenomic 
polynucleotides can be administered in a range of about 100 ng to about 200 mg of 
DNA for local administration. Concentration ranges of about 500 ng to about 50 mg, 
about 1 ^g to about 2 mg, about 5 fig to about 500 |ig, and about 20 jig to about 1 00 jig 

30 of DNA can also be used during a gene therapy protocol. Factors such as method of 
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action and efficacy of transfornnation and expression are considerations that will affect 
the dosage required for ultimate efficacy of the metastatic marker subgenomic 
polynucleotides. Where greater expression is desired over a larger area of tissue, larger 
amounts of metastatic marker subgenomic polynucleotides or the same amounts 
5 readministered in a successive protocol of administrations, or several administrations to 
different adjacent or close tissue portions of, for example, a tumor site, can be required 
to effect a positive therapeutic outcome. In all cases, routine experimentation in clinical 
trials will determine specific ranges for optimal therapeutic effect. 

Expression of an endogenous metastatic marker gene in a cell can also be 
10 altered by introducing in frame with the endogenous metastatic marker gene a DNA 
construct comprising a metastatic marker protein targeting sequence, a regulatory 
sequence, an exon, and an unpaired splice donor site by homologous recombination, 
such that a homologously recombinant cell comprising the DNA construct is formed. 
The new transcription unit can be used to turn the metastatic marker gene on or off as 
15 desired. This method of affecting endogenous gene expression is taught in U.S. Patent 
No. 5,641,670, which is incorporated herein by reference. 

The targeting sequence is a segment of at least 10, 12, 15, 20, or 50 
contiguous nucleotides selected from the nucleotide sequences shown in SEQ ID 
NOS:l-85 or the complements thereof The transcription unit is located upstream of a 
20 coding sequence of the endogenous metastatic marker protein gene. The exogenous 
regulatory sequence directs transcription of the coding sequence of the metastatic 
marker genes. 

A metastatic marker subgenomic polynucleotide can also be delivered to 
subjects for the purpose of screening test compounds for those which are useful for 

25 enhancing transfer of metastatic marker subgenomic polynucleotides to the cell or for 
enhancing subsequent biological effects of metastatic marker subgenomic 
polynucleotides within the cell. Such biological effects include hybridization to 
complementary metastatic marker mRN A and inhibition of its translation, expression of 
a metastatic marker subgenomic polynucleotide to form metastatic marker mRNA 

30 and/or metastatic marker protein, and replication and integration of a metastatic marker 
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subgenomic polynucleotide. The subject can be a cell culture or an animal, preferably a 
mammaK more preferably a human. 

Test compounds which can be screened include any substances, whether 
natural products or synthetic^; which can be administered to the subject. Libraries or 

5 mixtures of compounds can be tested. The compounds or substances can be those for 
which a pharmaceutical effect is previously known or unknown. The compounds or 
substances can be delivered before, after, or concomitantly with a metastatic marker 
subgenomic polynucleotide. They can be administered separately or in admixture with 
a metastatic marker subgenomic polynucleotide. 

10 Integration of a delivered metastatic marker subgenomic polynucleotide 

can be monitored by any means known in the art. For example. Southern blotting of the 
delivered metastatic marker subgenomic polynucleotide can be performed. A change in 
the size of the fragments of a delivered polynucleotide indicates integration. 
Replication of a delivered polynucleotide can be monitored inter alia by detecting 

15 incorporation of labeled nucleotides combined with hybridization to a metastatic marker 
probe. Expression of metastatic marker subgenomic polynucleotide can be monitored 
by detecting production of metastatic marker mRNA which hybridizes to the delivered 
polynucleotide or by detecting metastatic marker protein. Metastatic marker protein can 
be detected immunologically. Thus, the delivery of metastatic marker subgenomic 

20 polynucleotides according to the present invention provides an excellent system for 
screening test compounds for their ability to enhance transfer of metastatic marker 
subgenomic polynucleotides to a celL by enhancing delivery, integration, hybridization, 
expression, replication or integration in a cell in viiro or in an animal, preferably a 
mammaK more preferably a human. 

25 The above disclosure generally describes the present invention. A more 

complete understanding can be obtained by reference to the following specific examples 
which are provided herein for purposes of illustration only, and are not intended to 
limit the scope of the invention. 
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EXAMPLE 1 
Differentially Expressed Genes 

This example demonstrates polynucleotides that are differentially 
5 expressed in human breast or colon cancer cell lines. 

Human cell lines used to identify differentially expressed 
polynucleotides are the human breast cancer cell lines MCF-7 (non-metastatic), MDA- 
MB-231 (metastatic to bone and/or lung), and MDA-MB-435 (metastatic to lung) 
(Brinkley and Cailleau, 1980, Cancer Res, -^0:3118), and the colon cancer cell lines 
10 Kml2C (low metastatic) and Kml2L4A (highly metastatic) (Morikawa et al, 1988, 
Cawcer -/S: 1943-1 948). 

RNA was prepared from each cell line and reverse transcribed to form 
cDNA. The cDNA was amplified using random primers. Amplification products were 
visualized on a sequencing gel, and cDNA cprresponding to mRNA which was 
15 differentially expressed in the cell lines was identified. 

Expression patterns and sequence identification numbers of novel 
metastatic marker polynucleotides are shown in Table 1 . 

Expression patterns and sequence identification numbers of metastatic 
marker polynucleotides which correspond to known genes are shown in Table 2, and the 
20 corresponding proteins are described below. 

Osteopontin (SEQ ID NO:64) (OPN or Sppl for secreted phosphoprotein 
1) is a secreted extracellular matrix protein, often expressed during wound healing, 
involved in osteoclastic differentiation and activation, as described in Heymann et al,^ 
1998, Cytokine 70:155-168. Osteopontin is found in bone and other epithelial cells, and 
25 has been shown to stimulate proliferation of a quiescent subpopulation of prostate 
epithelial cells (see Elgavish et aL, 1998, Prostate i5:83-94). 

Osteopontin is implicated during the development of diabetic 
nephropathy (Fischer et uL, 1998, Diabetes -^7:1512-1518); the process of cartilage-to- 
bone transition during rigid bone healing after bone fracture (Nakase et uL, 1998, Acta 
30 Histochem 7(90:287-295): wound healing by an interaction with the receptor integrin 
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alpha(v)beta 3 after focal stroke (Ellison ef aL. 1998. Sfrokc 29:1698-1706); integrin 
receptor binding and signaling during cell attachment and mechanical stimulation of 
osteoblasts (Carvalho et aL. 1998. J. Cell Biochcm 7(^.376-J9()); kidney imrphogenesis 
(Denda et al.. 1998, MoL BioL Cell 9:1425-1435): and as an interstilial chemoatlraclanl 

5 in renal inflammation (Rovin and Phan, 1998, Am. Kidney Dis, 5/ : 1065- 1084). Mice 
lacking the osteoponlin gene showed modulation in osteoclast differentiation from wild 
type mice (see Rittlinge/ aL^ 1998,7, Bone Miner Res. 75:1 101-1 1 1 1). 

Osteopontin is synthesized by monocytes and macrophages within injury- 
sites, and can promote leukocyte adhesion through the alpha 4betal integrin. as 

10 described in Bayless ef al., 1998, J. Cell Sci. /;7:1 165-1 174. Osteopontin is 
transcriptionally regulated by retinoic acid (see Manji ef al.. 1998, J. Cell Physiol. 
l76:\-9): preferentially expressed in high grade metastatic brain tumors compared to 
low grade brain tumors, and inducible by tissue plasminogen activator (tPA) in glioma 
cell lines (see Tucker el al., 1998, Anticancer Res. 75:807-8 1 2). Osteopontin is 

15 expressed in about 73% oi" primary gastric carcinoma tissues and correlated with the 
progression of human gastric carcinoma and lymphogenous metastasis (see Ue e( uL. 
1998, lnt,J. Cancer 79:127-132). 

Ni]2(SEQ ID NO:65) is described in Boyd a aL. 1994, Cell 79:341 -35 K 
Aderjovirus ElB 19 kDa protein protects against cell death induced by viral mfecuon 

20 and external stimuli, and can be functionally substituted with the Bcl-2 proioncogene. 
ElB 19 kDa interacting proteins (Mipl, Nip2, and Nip3) were discovered in yeast two- 
hybrid studies conducted to discern proteins that interact with 19 kDa protein, as 
described by Boyd ei ai, supra. Nip 1, 2, and 3 interact with discrete domains of ElB 
19 kDa, and similarly also interact with Bcl-2, in both cases promoting cell survival. 

25 Ca-depcndent protease (SEQ ID NO:66) is Ca'--dependent protease (also 

called calpain), activity of which is present in every vertebrate cell that has been 
examined. Ca' "-dependent protease activity is associated with cleavages thai alter 
regulation of various enzyme activities, with remodeling or disassembly of the cell 
cytoskeleton, and with cleavages of hormone receptors (see Goll e! al.. 1992, Bioessays 

30 /-/(8):549-556). Ca'"-dependent protease activity is regulated by binding of Ca"' to 
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specific sites on the calpain molecule, with binding to each site generating a specific 
response corelated with a specific activity (e.g.. proteolytic activity, calpastatin binding, 
etc.), as described in Goll e( ai Excessive activation of the Ca "-dependent protease 
calpain may play a role in the pathology of disorders including cerebral ischemia, 
cataract, myocardial ischemia, muscular dystrophy, and platelet aggregation. 
Therapeutic applications include selective Ca'--dependent protease inhibition, as 
described in Wang and Yuen, 1994, Trends PharmacoL Sci. 75(1 1):412-419. 

[ GF-R (insulin-like growth factor receptor) (SEQ ID NO:67) is a 
transmembrane tyrosine kinase linked to the ras-raf-MAPl<C(mitogen-activated protein 
kinase) cascade and required for the cell to progress through the cell cycle {Werner and 
Roith. 1997. Crii. Rev. Oncog <!^(l):71-92). IGF-R mediates mitogencsis, growth 
hormone action, cell survival and transformation to and maintenance of the malignant 
phenolype. IGF-R is a member of the growth factor receptor tyrosine kmase 
superfamily, exists as covalent cross-linked dimers where each monomer is composed 
of two subunits. and is bound by ligand in the extracellular domain (Mclnnes and 

Sykes, 1997, Biopolymers Vj{5):j39-366)- 

The domains of the IGF-R are described in Scpp-Lorcnzino, 1998, 

Breast Cancer Res Treat 77(3):235-253, including domains responsible for milogcnesis. 

transformation, and protection from apoptosis. IGF-R expression is increased in breast 
0 cancer cells derived from tumor tissue and cell lines, as described in Surmacz et uL, 

1998. Breast Cancer Res Treat 77(3):255-267, and increased IGF-R may increase 

tumor mass and/or aid tumor recurrence by promoting proliferation, cell survival, and 

cell-cell interactions. Human pancreatic cancers overexpress IGF-R and its ligand 

(Korc, 1998, Swrg Oncol Clin N Am 7(1): 25-41). and expression ofIGF-1 and IGF-R is 
5 determined to be a prognostic factor (refiecting the interaction between the neoplastic 

cells and their microenvironment) for lymphocytic infiltration in thryoid carcinomas 

(Fonseca et ai, 1997. Verh Dlsch Ges Pathol 81:82-96), 

ILGF-BP5 (SEQ ID NO:68) is insulin-like growth factor binding protein 

5, described in Allander et aL. 1994, ./. BioL Chem. 269:10891-10898. The gene and 
0 promoter for 1GF-BP5 are characterized by Allander el ai. 1994. J. Biol Chem. 
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269:10891-10898, and some general actions of IGF-BPs are described in Chan and 
Spencer, 1997, Endocrine 7:95-97. Potential impact of IGF-BPs on cancer cell growth 
is described in Oh, 1997, Endocrine 7:1 1 1-1 13, and Oh, 1 998, Breast Cancer Res Treat 
^7:283-293. 1GF-BP5 is expressed during brain development: 1GF-BP5 and IGF- 1 

5 mRNAs are synchronously coexpressed in principal neurons of sensory relay systems, 
including the olfactory bulb, medial and dorsal lateral geniculate bodies, and ventral 
tier, cochlear, lemniscal, and vestibular nuclei, and are transiently coexpressed in 
principal neurons of the anterodorsal nucleus, as described in Bondy and Lee, 1993, J. 
Neurosci 73(12):5092-5104. 1GF-BP5 is expressed by luminal or cumulus granulosa 

10 cells in virtually all follicles, and is highly abundant in stromal interstitial cells of the 
mature ovary (see Zhou and Bondy, 1993, Biol. Reprod ^5:467-482). 1GF-BP5 
induction is strongly stimulated during differentiation of skeletal myoblasts and is 
correlated with IGF-R activation as described in Rousse ei aL, 1998, Endocrinology 
719:1487-1493. IGF-BPS and other components of the IGF system are critical in 

15 postnatal brain development (see Lee et aL, 1996, J, Cereh Blood Flow Meiab 76:227- 
236). 

IGF-BPS stimulates bone cell proliferation by an IGF-independent 
mechanism involving IGF-BP5-specific cell surface binding sites, as described in 
Mohan et aL, 1995, J. Biol Chem 270:20424-20431. In connective tissue cell types, 

20 IGF-BPS has a lowered binding affinity to the extracellular matrix which allows IGF-I 
to better equilibrate with the receptors which in turn potentiates IGF-I action on 
fibroblasts and smooth muscle cells (Clemmons, Mol Cell Endocrinology J 40:19-24), 

Lactate dehydrogenase (SEQ ID NO:69) is a member of the LDH group 
of tetrameric enzymes with five isoforms composed of combinations of two subunits, 

25 LDH-A and LDH-B. Shim et al, 1997, Proc. Natl Acad. Sci. P-/:6658-6663, described 
the relationship between LDH-A and neoplasia. In particular, overexpression on LDH- 
A may contribute to altered metabolism that confers neoplastic growth advantage. The 
expression pattern of LDH in the present invention is consistent, in that LDH 
expression is higher in two metastatic breast cancer cell lines than in a non-metastatic 

30 breast cancer cell line (Table 2). High or increasing lactate dehydrogenase (LDH) levels 
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in tumor tissue and cells is associated with poor survival rate in small cell lung 
carcinoma (SCLC), as described in Ray et ai, 1998, Cancer Detect Prev 22:293-304, 
making it a useful prognostic indicator for SCLC as discussed in Stokkel et aL^ 199&, J. 
Cancer Res Clin Oncol 72^:215-219. 
5 Ufo TKR (SEQ ID NO:70) is described in Schulz et al, 1 993, Oncogene 

5:509-513. This protein has been reported as a marker in tumors, but has not previously 
been reported in breast cancer. According to the present invention, expression is found 
in the MDA-MB-231 breast cancer cell line, but not in the MSF-7 or MDA-MB-435 
cell lines. This gene and protein provide new markers for distinguishing breast cancer 
10 tissue of different types of metastatic potential. 

Initially isolated from primary human myeloid leukemia cells, the ufo 
oncogene (also called Axl or Ark) is a receptor tyrosine kinase (RTK). Its genomic 
structure is described in Schulz et al, supra,, and its differential expression is described 
in Challier et a/., 1996, Leukemia 10'JS\-1S1. The ufo protein is a member of a class 
15 of RTKs having two fibronectin type III domains and two imnnunoglobulin-like 
domains present in the extracellular portion, and is preferentially expressed in 
monocytes, stromal cells, and some CD34-positive progenitor ce:lls (Neubauer et aL, 
1997, Leuk Lymphoma 25:91-96). Ufo has an extracellular structure similar to neural 
cell adhesion molecules, and has direct or indirect binding sites for PLCgamma, GRB2, 
20 c-src, and Ick (Braunger et al, 1 997, Oncogene 7 4:26 1 9-263 1 }. 

eIF-2 (SEQ ID N0:71) is a translation initiation factor, and functions as 
a heterotrimeric GTP-binding protein involved in the recruitment of methionyl-tRN A to 
the 40 S ribosomal subunit (Gasper et aL, 1994, J. BioL Chem. 269:341 5o422). 
According to the present invention, higher expression is found in two metastatic breast 
25 cancer cell lines and not in cell line MCF-7. 

elF-2 is involved in introducing the initiator tRNA into the translation 
mechanism and performing the first step in the peptide chain elongation cycle. eIF-2 is 
associated with a 5 subunit molecule having GTP recycling function called eIF-2B 
(Kyrpides and Woese, 1998, Proc, Nat'l Acad. Sci. USA 95:3726-3730, and Kimball et 
30 oL, 1998, J- Biol. Chem. 273:12841-12845). 
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eIF-2 has subunits alpha and beta. eIF-2alpha is phosphorylated at Ser 
51 and then modulates the interaction of eIF-2 and eIF-2B, as described in Kimball ei 
aU 1998, Proiein Expr, Purij: / 2:4 15-4 19, Kimball el aL. 1998, J, Biol. Cheni, 
273:3039-3044, and Paviit 199H, Genes Dev. 72:514-526. h is reporled that by 

5 regulating translation initiation, control of cell growth and division in eukaryotic cells is 
achieved: for example, clotrimazole, a potent anti-proliferative agent in vitro and in 
vivo, depletes intracellular Ca'" stores, which activates PKR, resulting in the 
phosphorylation of eIF-2alpha, and the ultimate inhibition of protein synthesis and 
blockage of the cell cycle in Gl phase (Aktas ei aL. 1998, Proc. Naf'l Acad. Sci. USA 

10 95:8280-8285). Additionally, Kim et aL, 1998, MoL Med. -/:I79-190, show that nitric 
oxide (NO) suppresses protein synthesis in cell types including human ovarian tumor 
cells by stimulating phosphorylation of elF-2alpha. 

Glutaminvl cvclase (SEQ ID NO:72) is described by Song et oL. 1994, 
./- MoL Endocrinol. l3\ll-%6. and is expressed most highly in the most metastatic cell 

15 line MDA-MB-435, as compared to less metastatic line MDA-MB-231 and non- 
metastatic line MCF-7. Glutaminyl cyclase (also called glutamine cyclotransferase) 
converts glutaminyl-peptides (such as gonadotropin-releasing homione and thyrotropin- 
releasing homione) into pyroglutamyl-peptides, as described in Busby et al,. 1987, ./. 
BioL Chem. 262:8532-8536, Fischer and Spiess, 1987. Proc. Nat'l Acad. Sci. USA 

20 .V-^:3628-3632, and Pohl et uL^ 199L Proc. Nat'l Acad. Sci. fSrSM0059-l0063. Cloning 
and sequence analysis of glutaminyl cyclase derived from a human pituitary cDNA 
library is described in Song el aL. 1994, J. MoL EndocrinoL /i:77-86. Studies on the 
catalytic pathway of glutaminyl cyclase and its substrate specificity are described in 
Gololobov e( ciL, 1996, BioL Chem. Hoppe Seyler i 77:395-398. Assays for the 

25 presence of glutaminyl cyclase activity are described in Koger el aL, 1989. Meihod 
EnzymoL /6cV:358-365 and Houseknecht el oL. \ 998,, Biotechnique.s 24:346. 

gpl 30 (SEQ ID NO:73) is transmembrane protein glycoprotein 130. 
gpl30 is a signal transducing shared component of the receptor complexes for the 
interleukin-6 (lL-6)-type cytokines (Hirano et aL, 1997. Cytokine Grow'th Factor Rev. 

30 5:241-252), including lL-6, IL-II. leukemia inhibitor factor (LIF), oncostatin M 
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(OSM), ciliary neurotrophic factor and cardiotrophin-1 . The N-terminal of gpI30 is an 
extracellular immunoglobulin-like portion of the protein (Hammacher el oL. 1998, 
Biol, Chcm. 273:2210\'21701). Signal transduction including gpl30 occurs through 
the gplSO/Jak/STAT pathway 1 (Hemrich 1998. Biochem. J. JJ-/:297-3l4). The 
5 cytokines acting through the pathway that includes gpl30 (also called gpl30 cytokines) 
exhibit pleitropic biological activities including immune, hematopoietic, and neural 
effects (Nakashima and Taga, 1998, Semin Hemaioi ji:2 10-221, Thompson e/ ai, 
1998, Neurosciencc <S-/: 1247- 1255. Hirano, 1998, //?/. Rev. Immunol. /6;249-284, Marz 
ei aL, 1997, Eur. J. Neurosci, 9:2765-2773, and Betz and Mulier, 1998, Inf Immunol 

10 /f;:l 175-1 184). 

gpl30 cytokines arc reported to control survival and proliferation of 
myeloma cell lines and primary myeloma cells (Klein, 1998, Curr. Opin. HematoL 
J: 186-1 91). gpl30 is expressed in the majoniy of renal cell carcinomas and has an 
important role in the proliferation of some renal cell carcinoma cell lines (Costes et al., 

15 1997, Clin. Pathol. o7;:835-840). 

E-cadherin (SEQ ID NO:75) is a member of a family of glycoproteins 
responsible for calcium-dependent cell-cell adhesion and is implicated in maintaining 
cytoskeletal integrity. Epithelial cadherin (E-cadherin) mediated cell adhesion system 
in cancer cells is inactivated by multiple mechanisms corresponding lo the pathological 

20 features of the particular tumor type (Hirohashi. 1998, Am J Pathol / jj;333-339). In 
general the cadherin system mediates Ca^--dependent homophilic cell-celi adhesion. 
Transcriptional inactivation of E-cadherin expression occurs frequently in tumor 
progression, and thus inactivation or downregulation of E-cadherin plays a significant 
role in multistage carcinogenesis (Hirohashi, 1 998. Am J Pathol 7iJ:333-339). 

25 E-cadherin is characterized as a tumor suppressor of the metastatic 

phenotype, as described in MacGrogan and Bookstein. 1997, Semm Cancer Biol 8:\\- 
19, and cadherins are important determinants of tissue morphology including invasive 
carcinoma as described in van der Linden, 1996. Earlv Pregnancy 2:5-14, and Yap, 
1998, Cancer Invest. /^5:252-261. 
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Mechanisms of action of cadherins are discussed in Daniel and 
Reynolds, 1997, Bioessays 79:883-891. The structure and function of cell adhesion 
molecules including E-cadherin are described in Joseph-Silverstein and Silversiein^ 
1998, Cancer Invest. 7 6:1 76-1 82, Yap et aL, 1997. Anna. Rew Cell Dev. Biol i 3:1 19- 
5 146, and Uemura, 1998, Cell 95:1095-1098. Cell adhesion molecules including E- 
cadherin are potential targets for anti-cancer drugs and therapeutics to treat acute or 
chronic inflammatory disease as described in Buckley and Simmons, 1997, Mol Med 
Today 3:449-456, Moll and Moll, 1998, Virchows Arch .^32:487-504. 

According to the present invention, E-cadherin is expressed in non- 
10 metastatic breast cancer cell line MCF-7, and not in MDA-MB-231 and MDA-MB-435. 
The expression products are diagnostic markers indicating the metastatic potential of 
breast cancer tissue samples. 

Serpin (SEQ ID NO:76), serine protease inhibitors, are a family of 
protease inhibitors that inhibit chymotrypsin-like serine proteases (Whisstock et al, 
15 1998, Trends Biochem. Sci, 23:63-67) and that have the unique ability to regulate their 
activity by changing the conformation of their reactive-center loop; studies of serpin 
variants provide definition for the functional domains of serpins that control the folding 
and link serpins mutations to disease (see Stein and CarrelK 1995, Nat, Struct. Biol, 
2:96-113). Serine protease cleavage of proteins is essentia! to a wide variety of 
20 biological processes, and the cleavage is primarily regulated by the cleavage inliibitors, 
as described in Wright, 1996, Bioessays 75:453-464. Members of the serpin family 
include alpha 1-antitrypsin (AAT) (Carrell et al, 1996, Chest 770:2435-2475), alpha2- 
anti-plasmin (PAI-1 and PAI-2) (Andreasen et al. 1997, Int, J, Cancer 72:1-22), 
thrombin, urokinase plasminogen activator, and kallikrein (Turgeon and Houenou, 
25 1997, Brain Res Brain Res Rev 25:85-95). Some serpins also have other activities 
including neuronal differentiating and survival activities (Becerra, 1997, Adv. Exp, Med. 
Biol. ^25:332-237) and tumor suppression (Sager ef aL, 1997, Adv. Exp. Med. Biol 
425\11'%%), PAl-1 and PAI-2 are linked to cancer metastasis, as described in 
Andreasen et al. 1997, Int. J. Cancer 72:1-22. 
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pS2 (SEQ ID NO:77) was isolated from MCF7 human breast cancer 
cells, as described in Takahashi et ai, 1990, FEBS Letters 2(57:2 83-286. pS2 is 
estrogen-regulated. Speiser et aL^ 1991 ^ Anticancer Research 1 7:679-6&4^ reported that 
the pS2 status declined from well to poorly differentiated ovarian cancer. pS2 
5 expression also is associated with a good prognosis in breast cancer patients. According 
to the present invention, pS2 is expressed in MCF-7 cells, but not in two metastatic 
breast cancer cell lines 

pS2 (presenilin-2 or trefoil factor 1 (TFF 1)) is a trefoil polypeptide 
normally expressed in the mucosa . of the gastrointestinal tract, and found ectopically in 

10 gastrointestinal inflammatory disorders and various carcinomas (May and Westley, 
1997, J. Pathol ] 83:4-7, pS2 is expressed in breast cancers (Poulsom et al, 1997, J. 
PathoL 755:30-38). pS2 is a pleitropic factor involved in mucin polymerization, cell 
motility (Modlin and Poulsom, 1997, J. Clin. Gastroenterol 25(1):S94-S100), cell 
proliferation and/or differentiation, and possibly, in the nervous system (see Ribieras et 

15 al,, ]99^,Biochim,Biophys.Acta. ]378:¥6\'F77X_ 

LIV-1 (SEQ ID NO:78) is an estrogen-regulated protein reported in the 
MCF-7 cell line (Green et a/., GeneBank submission Accession No. U41060). 
According to the present invention, LIV-1 is expressed in MCF-7 cells, but not in two 
metastatic breast cancer cell lines. 

20 Leucine-isoleucine-valine -1 (LlV-1) and other members of the LIV 

family (LIV-2, 3, and 4) are binding proteins that represent a transport system for 
branched chain amino acids in E. coli as described in Yamamoto e/ aL, 1979, J. 
BacterioL 75S:24-32, and Yamamoto and Anraku, 1980, Bacterial 144:36-44, A 
human homologue to LIV-1 is both estrogen and growth factor inducible in MCF-7 

25 human breast cancer cell line (El-Tanani and Green, 1997, J. Steroid Biochem. Mol, 
Biol 60:269-276: El-Tanani and Green, 1996, Mol Cell Endocrinol 72-/:71-77; and El- 
Tanani and Green, 1996, Mol Cell Endocrinol 727:29-35). 

GTP-bindino protein (SEQ ID NO:79) is a member of the family of 
guanine nucleotide-binding regulatory proteins, G proteins. The protein is expressed in 

30 MCF-7 cells, but not in two metastatic breast cancer cell lines. 
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G proteins provide signaling mechanisms for the serpentine family of 
receptors as described in Dhanasekaran and Prasad, 1998, BioL Signals Recept 7:109- 
117. Studies indicate that the alpha as well as the beta gamma subumts of the GTP- 
binding proteins are involved in the regulation of several cellular responses, some of 

5 which responses are critical to the regulation of cell growth and differentiation 
(Dhanasekaran and Prasad, 1998, Biol Signals Recept 7:109-117). G protein coupled 
receptors regulate the mitogen activated protein kinase pathway as described in Russell 
and Hoeffler, 1996, J, Invest. Dermatol Symp Proc 7:119-122, and thus play a role in 
controlling cell growth. GTP binding proteins are also implicated in the regulation of 

10 intracellular transport as described in Ktistakis, 1 998, Bioessays 20:495-504. 

Chemokines induce various intracellular signaling pathways in natural 
killer cells by activating members of GTP binding proteins as described in Maghazachi 
and Al-Auokaty, 1998, FASEB J. 72:913-924. Heterotrimeric GTP binding proteins 
regulate distinct signaling pathways, some of which in turn regulate the activity of 

15 Na+/H-H exchanger proteins as described in Voyno-Yasenetskaya. 1998, Biol Signals 
Recept 7:mA24, 

Desmoplakin (SEQ ID NO:84) is a member of a family of proteins that 
serve as cell surface attachment sites for cytophasmic intermediate filaments. 

Vimentin (SEQ ID NO: 80) is a member of the intermediate filament 

20 gene family (Evans, 1998, Bioessays 20:79-86. Intermediate filaments are a major 
component of the cytoskeleton of higher eukaryotes. Vimentin gene knockout mice 
indicate degeneration of the cerebellar Purkinje cells (Galou et al, 1997, Biol Cell 
59:85-97). Vimentin is positive in immunohistochemical reactions of sarcomas and 
related lesions (Gaudin et al, \9n,AmJSurg Pathol 22:148-162), and of desmoplastic 

25 small round-cell tumors and their variants (Gerald et al, 1998, J. Clin. Oncol, 76:3028- 
3036). Vimentin is also expressed in neoplasms showing follicular dendritic cell 
differentiation as described in Perez-Ordonez and Rosai. 1998, Semin. Diagn. Pathol 
75:144-154, and in biphasic carcinomatous-sarcomatous malignant mixed muUerian 
tumors as described in Guarino et al. 1998, Tiwiori 5-/:391-397. 
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Cytochrome C Oxidase (CcO) (SEQ ID NO: 81) is the terminal enzyme 
of the respiratory chain of mitochondria and aerobic bacteria: it catalyzes electron 
transfer from cytochrome C to molecular oxygen, reducing the oxygen to water (Michel 

ei aL 1998. Annu Rev Biophys Biomol Siruct 27:329-356). Cytochrome C oxidase is a 
member of the superfamily of quinol and cytochrome C oxidase complexes that are 
related by a homologous subunit containing six positionally conserved histidines that 
ligate a low-spin heme and a heme -copper dioxygen activating and reduction center as 
described in Musser and Chan, 1998, J. MoL EvoL ^6:508-520. Cytochrome C and 
ubiquinol oxidases are membrane-bound redox-driven proton pumps which couple an 
electron current to a proton current across the membrane (see Karpefors et al, 1998, 
Biochim Biophys Acta 7J6j: 159-1 69). Analysis of mutant forms of cytochrome C 
oxidase is described in Mills and Ferguson-Miller, 1998, Biochim Biophys Acta i5J:46- 
52. Nitric oxide inhibits respiration at cytochrome C oxidase, as described in Torres et 
aL, 1 998. J, Bioenerg Biomemhr 30:63-69. 

Heat shock protein 90 (hsp90) (SEQ ID NO: 82) acts as a chaperone 
molecule in association with the glucocorticoid and progesterone nuclear receptors, and 
has A, B, and Z regions for facilitating -these interactions (Dao-Phan et aL, 1997, K4oI 
Endocrinol 77:962-972). Levels of hsp90 are reported elevated in active systemic lupus 
erythematosus (Stephanou e( aL^ 1997. Biochem J 327:103-106), Increased hsp90 
expression is implicated in regulation of forms of cell injury that lead to programmed 
cell death as described in Galea-Lauri et al.^ 1996, J. ImmunoJ. 757:4109-41 1 8. Hsp90 
is upregulated in regenerating fibers and diseased fibers of Duchenne muscular 
dystrophy (Bomman el al, 1996, Muscle Nerve 79:574-580), and is a candidate 
substrate for proteolysis during ionizing radiation-induced apoptosis of some breast 
cancer cells (Prasad et oL, 1998, 7/?/. .7. Oncol 73:757-764). Hsp90 is involved in 
dislocation of the mutant insulin receptors from the endoplasmic reticulum to the 
cytosol as described in Imamura et al.. 1998. J. Biol. Chem. 273:11 183-11188. and 
associates with and activates endothelial nitric oxide synthase as described in Garcia- 
Cardena el al. 1998. Nature 392:821-824. 
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Integrin alpha 6 (SEQ ID NO: 83) is in the family of integrins, 
heterodimeric, cation dependent cell membrane adhesion molecules that mediate cell- 
cell and cell-matrix interactions. Integrin alpha 6 is a compcment of the 
hemidesmosome complex (Jones et aL, 1998, Bioessays 20:488-494). Inlegrins 
5 maintain tissue integrity and regulate cell proliferation, growth, differentiation, and 
migration. (See Thomas el a/., 1997, Oral Oncol 33:381-388). In oral squamous cell 
carcinomas there is a variable loss or reduced expression of integrin alpha 6, as 
described in Thomas et al, 1997, Oral Oncol ii:38 1-388. Alpha 6 integrin also plays 
an active role in invasion of intestinal and diffuse-type cells of representative gastric 
10 carcinoma cell lines as described in Koike e/ al. 1997, J. Cancer. Res, Clin, Oncol. 
7231:310-316. 

Osteooenic protein- 1 (OP-1) (also called BMP-7) (SEQ ID NO: 85) is a 
morphogenetic factor (and a member of the bone morphogenetic protein (BMP) family 
of grov^h factors) and is highly expressed in kidney and involved in tissue repair and 

15 development (see Almanzar et al, 1998, J. Am, Soc. Nephrol. 9:1456-1463). OP-1 is 
also expressed in the developing nervous system and can induce dendritic growth in 
sympathetic neurons as described in Guo et aL^ 1998, Neurosci. Lett 245:131-134. OP- 
1 stimulates cartilage formation as described in Klein-Nulend et al, 1998, J. Biomed. 
Mater. Res, 40:614-620. 

20 OP-1 induces down-regulation of insulin-like growth factor binding 

proteins (particulariy IGFBP-5) thus affecting lGF-1 in the context of bone cell 
differentiation and mineralized bone nodule formation as described in Yeh et al., 1997, 
Endocrinology ]38:4]S]'4\90. OP-1 can be used as a bone graft substitute to promote 
spinal fusion and to aid in the incorporation of metal implants (Cook and Rueger. 1996, 

25 Clin. Orthop, 524:29-38). The three dimensional structure of OP-1 is reported in 
Griffith e/ a/.. 1996, Prac^'^/V^ca^3f&7 93:878-883. 

The protein encoded by SEQ ID NO:56 is a putative secreted protein and 
is highly expressed in fat tissue. 
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Tahip 1 Novel Differentially Expressed Metastatic Marker Polynucleotides 



TRANSCRIPT 
NUMBER 


SEQID 
NO: 


non- 

lIlCLa^luii^ 

breast 
IVICF-7 


breast 
cancer 
metastatic 
to bone 

anrl/nr liino 

MDA-MB- 

231 


breast 

cancer 

metastatic 

to lung 

MDA- 

MB-435 


low 

metastatic 
from 
colon 
KM12C 


high 

metastatic 
'rom colon 
KIVI12L4A 


901 


1 


- 


H- 


- 






907 


2 


- 


- 


+ 






9102b 


-> 
-) 


+ 


- 


- 






9114 


4 


- 


- 








9121a 


5 


- 




- 






9129 


6 


+ 


- 


+ 






91393 


7 




- 


- 






9143b 


8 




- 


- 






9157b 


9 


- 


- 








9166 


10 


+ 


- 


- 






9170b 


11 


- 


+ 


- 






9190a 


12 




- 


- 






9191 


13 


- 


- 


4- 






9216 


14 


- 


- 








9224c 


15 


+ 


- 


- 






9230b 


16 




- 


- 







924 


17 


+ 


- 


- 






9242a 


18 


- 




- 






9259a 


19 


- 


- 


+ 






9261 


20 


- 


4- 








9272 


21 


+ 










9293b 


2"' 




+ 








9304b 


23 


+ 










9307a 


24 




+ 








931 


25 


+ 










9313 


26 
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TRANSCRIPT 
NIJMBFR 

I ^ iVi U LilV 


SEQ ID 
NO: 


non- 

metastatic 

breast 

MCF-7 


breast 
cancer 
metastatic 
to bone 
and/or lung 
MDA-MB- 


breast 

cancer 

metastatic 

to lung 

MDA- 

MB-435 


low 

metastatic 
from 
colon 
KMI2C 


metastatic 
from colon 
KMPL4A 


yj 1 u 


97 












0"^ 1 RK 




-f 












70 






-j_ 








'in 














J I 


4- 










yjj) / 




_L 
















-4- 










1/1 

j4 


_L 












J!) 




_l_ 








Vj J I D 


30 




-1- 










J / 


} 












JO 












yj / JO 


jy 






■ 








A(\ 
4U 












V->oDC 


A 1 
4 1 




























Al 
4 J 


+ 










yjy^ 




-l- 












4S 

*T J 




4- 








044 h 














1 


47 












953 


48 












954a 


49 


+ 










968 


50 


4- 










971 


51 












983c 


52 




4- 








985 


53 


+ 










990 


54 
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TRANSCRIPT 
NUMBER 


SEQ ID 

NO: 


non- 

metastatic 

breast 

MCF-7 


breast 

cancer 

metastatic 

to bone 

and/or lung 

MDA-MB- 

231 


breast 

cancer 

metastatic 

to lung 

MDA- 

MB-435 


low 

metastatic 
from 
colon 
KMI2C 


high 

metastatic 
from colon 
K:M12L4A 


998 


55 


- 


- 








316 


56 




- 


- 




- 


126c 


57 


- 


- 


+ 






207-4 


58 




+ 


- 






265-3 


59 


+ 


- 


- 






29B 


60 






+ 






305B-25 


61 












326B-39 


62 


-t- . 










34B-11 


63 






+ 







+ indicates differential expression as identified in differential display 



- indicates absence in differential display 



For transcript number 316, reverse transcription PCR (RT-PCR) was 
5 used to detect expression in the breast cancer cell lines. 



Table 2. Differentially Expressed Metastatic Marker Polynucleotides 



TRANSCRIPT 
NUMBER 


protein 


SEQ 
ID 
NO: 


non- 

metastatic 

breast 

MCF-7 


breast 

cancer 

metastatic 

to bone 

and/or lung 

MDA-MB- 

231 


breast 
cancer . 
metastatic 
to lung 
MDA-MB- 
435 


902 


osteopontin 


64 






+ 


9112 


nip 


65 




+ 




9132 


Ca-dependent 
protease 


66 




+ 




9158 


IGF-R 


67 


+ 






9174 


1LGF-BP5 


68 
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TRANSCRIPT 
NUMBER 


protein 


SEQ 
ID 
NO: 


non- 

metastatic 
breast 

MCF-7 


breast 

cancer 

metastatic 

to bone 

and/or lung 

MDA-MB- 

231 


breast 
cancer 
metastatic 
to lung 
MDA-MB- . 
435 


9177 


lactate 

dehydrogenase 


69 




-f- 


-f 


9202 


ufo TKR 


70 


- 






9210 


elF2 


71 


- 


+ 


+ 


9212 


giuiaminyl 

C VL idbC 


72 


- 




+ 


92 I 3 




73 






+ 




Tr.Fh-ll 

I \J\ u-ii 


74 




+ 




9232 


c-Caanerin 


/ J 








9239 


scrpin 


76 




1- 


- 


9250 


secreted pS2 


77 


4- 


- 




9260 


LIV-! 


78 


+ 






93 1 5 


GTP-binding 
protein 


79 








9317 


vimentin 


80 


- 






938 


cytochrome C 


81 




ii 




9382 


l-lsp 90 


82 


- 






9394 


integrin a6 


83 








956 


desinoplakin 


84 








970 


osteogenic 
protein 


85 









+ indicates differential expression as identified in differential display 
- indicates absence in differential display 



Within the scope of ihe invention are variants of the proteins described 
5 above. A variant is a protein encoded by a polynucleotide wherein the global sequence 
identity of the DNA, as compared to the corresponding SEQ ID NO: herein, is at least 
65% as determined by the Smith- Waterman homology search algorithm as implemented 
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in MPSRCH program (Oxford Molecular) using an affine gap search with the following 
search parameters; gap open penally of 12, and gap extension penalty oi' 1 . The protein 
encoded by the DNA having the sequence identity described above wilt exhibit the 
percent activity described in the preceding paragraph. 
5 Also within the scope of the invention are fusion proteins comprising the 

proteins and variants disclosed herein. Proteins preferably used in fusion protein 
construction include beta-galactosidase. beta-glucuronidase, green fluorescent protein 
(OFF), autoiluorescent proteins including blue fluorescent protein (BKP). glutathione-S- 
transferase (GST), luciferase, horse radish peroxidase (HRP) and chloramphenicol 

10 acetyltransferase (CAT). Additionally, epitope tags are used in fusion protein 
constructions, including Histidine (His) tags, FLAG lags, influenza hemagglutinin (HA) 
tags, Myc tags, VSV-G lags, and thioredoxin (Trx) tags. Other fusion constructions can 
include maltose binding protein (MBP), S-tag. Lex A DNA binding domain (DBD) 
fusions, GAL4 DNA binding domam fusions, and Herpes simplex virus (HSV) BP16 

15 protein fusions. 

These fusions can be made by standard procedures in the art of 
molecular biology, and many are available as kits from, for example, Promega 
Corporation (Madison, Wl); Stratagene (La Jolla, CA); Clontech (Mountainview. CA): 
Santa Cruz Biotechnology (Santa Cruz. CA); MBL International Corporation (MIC, 

20 Watertowai, MA); and Quantum Biotechnologies (Montreal. Canada). 

The proteins of the invention, and variants as described herein, can also 
be used to detect protein interactions in vivo, using the yeast two-hybrid system, for 
example as described in U.S. Patent No. 5.674,739. 

In addition to the ribozyme and antisense constructs described above, the 

25 polynucleotides of the invention can be used for inhibiting transcription via triple helix 
formation as disclosed in U.S. Patent No. 5.674.739. 

Those skilled in the art will recognize, or be able to ascertain, using not 
more than routine experimentation, many equivalents to the specific embodiments of 
the invention described herein. Such specific embodiments and equivalents are 

30 intended to be encompassed by the following claims. 
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All patents, published patent applications, and publications cited herein 
are incorporated by reference as if set forth fully herein. 
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CLAIMS 



We claim: 

1. An isolated and purified human protein comprising an amino acid 
sequence which ,s at least 85% identical to an amino acid sequence encoded by a nucleotide 
sequence selected from the group consisting of SEQ ID NOS: 1 -63 or the complement thereof 

2. The isolated and purified human protein of claim 1 wherein the amino 
acid sequence is at least 95% identical. 

3. The isolated and purified human protein of claim 1 wherein the amino 
acid sequence is encoded by a sequence selected from the group consisting of SEQ ID 
NOS : I -63 or the complement thereof 

4. A fusion protein which comprises a first protein segment and a second 
protein segment fused to each other by means of a peptide bond, wherein the first protein 
segmem consists of at least six contiguous amino acids selected from an amino acid sequence 
encoded by a nucleotide sequence selected from the group consisting of SEQ ID NOS: 1-63 or 
the complement thereof 

5. A preparation of antibodies which specifically bind to a human protein 
which comprises an amino acid sequence encoded by a nucleotide sequence selected from the 
group consisting of SEQ ID NOS: 1 -63 or the complement thereof 

6. A method for detecting metastatic tumor cells in a tissue sample. 

comprising the step of 

measuring in said tissue sample an expression product of a gene which 

comprises a coding sequence selected from the group consisting of SEQ ID N0S:1, 2. 4. 5. 

9, 1 1. 13, 14. 18. 19. 20. 22, 24. 26, 29. 30, 33. 35. 36. 38-41. 45. 48. 52. 55. 57, 58, 60. 63- 
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66, 69-74, 76, 80, 82, and 83, wherein a tissue sample which expresses the product is 
categorized as containing metastatic tumor cells. 

7. The method of claim 6 wherein the expression product is protein. 

8. The method of claim 7 wherein the protein is measured using an 
antibody which specifically binds to the protein. 

9. A method for detecting metastatic tumor cells in a tissue sample, 
comprising the step of: 

measuring in a tissue sample an expression product of a gene which comprises 
a sequence selected from the group consisting of SEQ ID NOS:3, 7, 8, 10, 12, 15-17, 2 K 23, 
25, 28, 31, 34, 37, 42-44, 46, 47, 49-51, 53, 59, 61, 62, 67, 68, 75, 77-79, 81, 84, and 85, 
wherein a tissue sample which does not express the product is categorized as metastatic. 

10. The method of claim 9 wherein the expression product is protein. 

11. The method of claim 10 wherein the protein is measured using an 
antibody which specifically binds to the protein. 

12. A method for determining metastatic potential in a tissue sample, 
comprising the step of: 

measuring in a tissue sample an expression product of a gene which comprises 
a sequence selected from the group consisting of SEQ ID NOS:L 2, 4, 5, 9, 1 1, 13, 14, 18, 
19, 20, 22, 24, 26, 29, 30, 33, 35, 36, 38-41, 45, 48, 52, 55, 57, 58, 60, 63-66, 69-74, 76, 80, 
82, and 83, wherein a tissue sample which expresses the product is categorized as havine 
metastatic potential. 



13. The method of claim 12 wherein the expression product is protein. 
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14. The method of claim 13 wherein the protein is measured using an 
antibody which specifically binds to the protein. 

15. A method for determining metastatic potential in a tissue sample, 
comprising the step of: 

measuring in a tissue sample an expression product of a gene which comprises 
a sequence selected from the group consisting of SEQ ID NOS:3, 7, 8, 10, 12, 15-17, 21, 23, 
28, 31, 34, 37, 42-44, 46, 47, 49-51, 53, 59, 61. 62, 67, 68, 75, 77-79, 81, 84, and 85, wherein 
a tissue sample which does not express the product is categorized as having metastatic 
potential. 

16. The method of claim 1 5 wherein the expression product is protein. 

17. The method of claim 16 wherein the protein is measured using an 
antibody which specifically binds to the protein. 

1 8. A method of predicting the propensity for metastatic spread of a breast 
tumor preferentially to bone or lung, comprising the steps of: 

measuring in a breast tumor sample an expression product of a gene which 
comprises a sequence selected from the group consisting of SEQ ID NO: L 5, 1 1 , 1 8, 20, 22, 
24, 30, 33, 35, 36, 38, 45, 52, 58, 65, 66, 70, 74, 76, and 80, 

wherein a breast tumor sample which expresses the product is categorized as 
having a propensity to metastasize to bone or lung. 

19. A method of predicting propensity for metastatic spread of a breast 
tumor preferentially to lung, comprising the steps of: 

measuring in a breast tumor sample an expression product of a gene which 
comprises a sequence selected from the group consisting of SEQ ID N0S:2, 4, 9, 13, 14, 19, 
26, 29, 39-41, 48, 55, 57, 60, 63, 64, 72, 73, 82, and 83, 
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wherein a breast tumor sample which expresses the product is characterized as 
having a propensity to metastasize to lung. 

20. A method of predicting propensity for metastatic spread of a colon 
tumor, comprising the steps of: 

measuring in a colon tumor sample an expression product of a gene which 
comprises the nucleotide sequence shown in SEQ ID NO:56, 

wherein a colon tumor sample which expresses the product is characterized as 
having a low propensity to metastasize. 



PCT/US99/24222 

WO 00/22130 



SEQUENCE LISTING 

<110> Chiron Corporation 

<120> METASTATIC BREAST AND COLON CANCER 
REGULATED GENES 



<130> 200130.460 

<14 0> PCT 

<141> 1999-10-13 

<160> 85 



<170> FastSEQ for Windows Version 3.0 



<210> 1 

<211> 142 

<212> DNA 

<213> Homo sapien 



<220> 

<221> misc_feature 
<222> (1) . . - (142) 
<223> n ^ A,T,C or G 



<400> 1 

cacccaagaa ctaagaaaca aagggagaat gtacttttgt agcttagata agcaatgaat 

cagtaaagga ctgatctact tgctccacca cccctccctt aataataaca r.ttactgtnn 
atttcctggg cctaagactc ta 



<210> 2 

<211> 331 

<212> DNA 

<213> Homo sapien 



<220> 

<221> misc_feature 
<222> (1) - . . (331) 
<223> n = A,T,C or G 



<400> 2 

cgcgagcaga caacataatt tatttccaga aaacaacaga aatgaacatc atcatgaata 

catgaaatcg gctgtgatgt gtgaactgct aagggccaaa tgaacgtttg cagagcagtg 

ggcacaatgt ttacaatgta tgtgtatgtc actttcggta cctgtgaatg catggggacg 

tgctgaaccc gaaaaaaagt gcctttccat aaggactgca atanagaggg caatttaccc 

tggtggtaca cggaacctan attcactcct gccatgcctt gccaatagta anctgcaggg 

tggaacaaga aatcacttgc tctgggggga a 

<210> 3 

<211> 1112 

<212> DNA 

<213> Homo sapien 
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<220> 

<221> misc_feature 
<222> (1) . . . (1112) 
<223> n = A,T,C or G 



<400> 3 

ccnnrmnnrm ntncntnnnn ncnnnnccnn ngnnnnnctn gcccrmacng ctnnnccccn 60 

nntnctnntn gntnangnnc ngaanccgcn nnnngnnnnn acnatnntnn gncgnnnnnt 120 

tcgttnnnnc ntgnntccnc nnnnnctngt ncnnnnnggn ggngcgcncc nccnancctn 180 

cctcnntgnn ncnnnctnnt nnctnngctg ngtctcncng cncngngcnn nnnggggtct 240 

nccgtnctnc nnnnncnnng ttttangncn gnaanacgcc gcgncgagct tttagccatg 3 00 

ggggataacc gaaccaaacn tnacactctc agaggatcca cctntgggtg caagcgaaac 360 

tngancnatc tatactctcg anggtncaag gacattgntg agagaaatgg anncacagcc 420 

cacgttcatt gggtangaga ctccnattaa natttctgtc tccccngatg ggccctagac 480 

ccatgaatcc ctattangat cccntcagcg gccanacncn gtggctccnc ctgtaatccc 540 

ccacntcggg aggctgatga gggcgaatcc aaggtcagga aatntatata gacncctggc 600 

taaccggnga accccccctc taaaancaaa aaaaaanncc nncnngtntt tanagggngt 660 

tntttttcnt cgccncgccc gncncgnccg cttnctngct ccncctgnnc nnncntccct 720 

ncnncnntgn tcancccngc gnnncgcnnc ntnccttnnt gngtctggtc ncncttcnnc 780 

ctctcttncn ccnntgtccn tngctctcag ccnctgcccc nccctnnccn tnngtgnnnc 840 

cnccntnatg nccncncnan aggngcangc nntggcncgc tgnccnntgt ntgtcnctcn 900 

acgganantg nactcncnac tnngnnacgc natnnnanct ctgctctcag atgacagcan 960 

cggnntnnnc ngcctctanc nncngnnncn nagccnncga nnnaggnanc cgcgntcant 1020 

cnnntttcnc tctncnntng catntctgat ngccgtgnct ncctcnnttn ctcnagcncn 1080 

tnnccacctc tcgtttagnc nctnnncnna nn 1112 



<210> 4 

<211> 183 

<212> DNA 

<213> Homo sapien 



<400> 4 

aaaactatga attccatact tgaggtttcc cagccaattg ctcccttctg ctttagaagt 60 

gactaggtac tgagagtaca aacactccca ctttataatg aaggcgccat gtcacccctt 120 

cctttacagg tcctggggtc caggagaccc agaatgaagg tgtcagttgg gcatgaagtg 180 

tta 183 



<210> 5 

<211> 1092 

<212> DNA 

<213> Homo sapien 

<220> 

<22l> misc_feature 
<222> (1) . . . (1092) 
<223> n A,T,C or G 



<400> 5 

ttncagacca agaagacttg atnagctgaa acccattgcn ctacttggaa ngtgatcngc 60 

aaaagctgcc tcagtcanac accggggata aatctggatt tgggttccgg cgtcaaggtg 120 

aanatnatac ctantaanga acnctgtaca ntgccncaag cangtganga ccncccacga 180 

gtttacatna atacaatnct gaaacnacnc aggctggttt tatatctaca tatttgactt 240 

accactatcn cantaaagtt tngcaccttt cnccgaacga aaanaacccc ccntnntgnn 300 

ttcttttnaa aanaccntng nnccncnttn ccgtcncncc ccnnatantn nncnnatccc 360 

cccctctncc nntccntnnn cgtaannggc gtngcttntg cngcntntgt cccgttttcc 420 
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tccgcttngt cntttntcta tatnggctnn tnttatnccn ngcccttcgt cncctnnngn 480 

ttcgtctgtn cntagtcctc ncnctngagc cccanttgnt acttcrmgct tcnnctccgc 540 

attccntctc cgcncnnanc ncnnntctca nannatgnnc nntnnctncn nccnatncnc 600 

cctnanagnt tcgnctagac cntcnacntt gtntcccgnn ctcttagngn tctgctncta 660 

gtgtntnnct catctcctct ncttctctct cctttgacnc ngnncnctcc atcatrmtct 720 

gnctttctca tcncnnnnng- cccctnctcn cnnagtntgn gtgcncnnnc ttnnnntcna 780 

nctngtcgcc tccgttttcn actinnnnccn nngcngnncg nnngctcttt ctntcnnUta 840 

gactnacctt ntctgnnnnn tcannctagc nctgtccntc tctnntctgc atcnttanac 900 

atcttnntcn cccnctcgca ncntnctntt nacnctcnca tacgttnccn nnctcagtcc 960 

gcagnnnngt tncntncngt cntctcgcgn ctcnnntcct ctctnnnacn cncctggtct 1020 

ncgnctcgct ccnncccatn cntncctcgt cgntcnnnnt cnnatacgtn tncangccnc 1080 

ntctctccnc tn 1092 



<210> 6 

<211> 504 

<212> DNA 

<213> Homo sapien 

<220> 

<221> misc_f eature 
<222> (1) . . . (504) 
<223> n = A,T,C or G 



<400> 6 

ctggagcggg atcatttana atactttaca gatatntgca ccaggtacat ntatntgcgt 60 

ccattggtag cacagctgag acctgtgtct cacatcagcr taggtgaagc ctactacaaa 120 

taatgccaag ggagaanagc cagtacacta tatggtttat actctttatc cctttattca 180 

tagcatgttt tttaaaaatg ttatattatg caacagatgt gaggcagcan ctaagctata 240 

cttaagaatt ttctctcacc ttccaaacca aagtgtcctg aacaagccag gagacttatt 300 

cttttgtgca ccctggtgca catctgactg ttgtcctanc canaaactct ctgaggccac 360 

tgaaagaaca gtggccctat cgatttcatt cctaggtctc aaaaatacna tgtngccttg 420 

taacataatt agggacagca cctctatttc acaattataa tctaaggtag gataagacga 480 

cacagcagca ataaacttac aagt 504 



<210> 7 
<211> 1132 
<212> DNA 
<213> Homo sapien 

<220> 

<221> misc_feature 
<222> (1) . . . (1132) 
<223> n = A,T,C or G 



<400> 7 

gcgngccccc tngtngnncn ttntncncng ttttctgctn tntttatnng aggnctnggt 60 

nnttnntctt agggnnntng tncggtcnng ttnntgttnc gagcagaaag tgnatatttc 120 

atgcngccaa gcttntttat tgaaaantcc taattntatt gnccgtntag taacatgttt 180 

gttcnacaan gctaatttct nataaancaa aacacannnt tttcttataa gtngtataaa 240 

ttatttnatt tacagaaact tgtttcaaaa canatgnact anntatttct nctcttttaa 300 

atanccanac taattttcta tccctngaca tctgttcatg. ttctatncag cagccaacac 360 

aaagtccanc. tgagagctct tgattaangt gtncgnatta tctagctact tccnacgttt 420 

tnggngcnng aaatgncttt taanancctg gcctcaaaaa anaaaaanan ccccccgnnn 480 

aggggnnttc cntntanaaa aanggntcnc tcnnccngtn ngagactgtc tccctgnntn 54 0 

ngnnnntcgc tntnatcang ngccncnang ctcnccntcn ctnnngcatt ngatnnntan 600 
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cnnnctgaga tgngnntang ctgntncntn ngtgtcntan gtctcgacgt tgnntggntn 660 

tangnancgn cnntntnnnc nnattgncga gngnntaagt gtgctcttct cntnacntct 720 

ntcnnnancn tctnngatgt tnatacggcc gtgcttnctt atcnntgana ncgntctnan 780 

nanntncgna tgagnntnta ctgcncncnt gtgtcatctt tctctctant gtgtnctnna 84 0 

tmcrmgtnat tncgcnnnac tgntantnag tggtatnnag anntcgnncg cnngngccnn 900 

tttnnctgtn gnnacnagnt ntcanganat tnatncnntc tncgtgatag anagntnagt 9^60 

gnnggntctg actgatncgt gtcctagtnn cngtgacatc gnncgttann gtcngcactc 1020 

tagtanannt nagtnngang ntgtanatnn ntctcntgtt tcagtnnagn cccncgagcg 1080 

cntcanntnt nantgtctcn tctnngtcgt anncntgtcg agtngtnana nn 1132 



<210> 8 

<211> 736 

<212> DNA 

<213> Homo sapien 

<220> 

<221> misc_f eature 

<222> (1) . . . (736) 

<223> n = A,T,C or G 



<400> 8 

ntgggcccga cgtcgcatgc tcccggncgn catggnnnnc tggtttggtc anatgtgaat 60 

aacgnagaan tgagaccacn ganaagaacc acantgtnan ggnnctr.gca cntgntanga 120 

antnagnaat gcctctttnc tgagggcntt nggnnntcat nnangggngt gnggnggntt 180 

ncacctgtaa taccaccact ttncnatgcc actgccngtg natcaccngn ngtaaggact 24 0 

tcaanaccag ccttatnaac ntgggnaaac cntntntcta ctaaaaatnc tnnaantatc 300 

tgngcnnngt ngngcgttct tntannnccn gctgnacnng angncngngn angntantcg 360 

cntgaacntg ncntgttana gtngcantga gcctaaatca cantgatgta ttnncatctg 420 

ggacgacacg ancngacgac tcncgtactn aaaaaaaaaa ncccnttnng ggggggtttt 480 

tnnnggtatt anntatantt ggagaanttt gggtcannng aatattntta catgaaaaat 54 0 

naggaataac tntatntgtg tacattgggt tnnaaanang acantantgg nnctaaactn 6 00 

ttnggggngg aggggnnatt agggnnttaa ttnggnnnct tnnaaanncn nntnnngtat 660 

nanaanantn tttnnanaag ngnantngnt ttaaancctn aangnttnnn tnctnttann 720 

ttnnaannnn anannn 736 



<2ia> 9 

<211> 690 
<212> DNA 
<213> Homo sapien 

<220> 

<221> misc_feature 
<222> (1) . . . (690) 
<223> n = A,T,C or G 



<400> 9 

tnnncctggn tggtcactcc cttctgtcct gttagctcat ggtgtaagat gatgtcttgt 60 

cagtattact gttttgctaa gccgcttcat tcatgcctac acaatttttt tttaaaaggg 120 

aactttagtt aattaagtga taagggactt aaatatgaat tanaatggtg cagaaagaga 180 

taccttttct ggatatttta aagtttaaag gtcantttct cttaatctga ttatgtgcac 240 

atatgaaaat ggcacatcat atacatgtaa aatcaggcag tatncattta ttaattactg 300 

tatttgacaa aggaaactct taaattataa tgtgaaacct ggttttatga aaccaatgac 360 

tagtgcanca tttcagcata tgcaaaaaaa aaanncctnt tggngngctg tttacaaagg 420 

aaattgttgg atttcacgat ggtttcagga naanaaggtt ttcntcatcn agggtaaacn 480 

tcccggataa ggcntngntt taatntnntt annccnnccn atngntaann gtggaaatta 54 0 
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ancctctgaa naaaanancc cacntnnttn gccttgggct tnantctntt tggcngnanc 
naaaggnnct tnccaggtnt cntgnngggc cngnngaann ataannaann nggggnnctt 
nggaaacctt ncnnnaanan tncccncccc 



600 
660 
690 



<210> 10 

<211> 395 ■ 

<212> DNA 

<213> Homo sapien 

<220> 

<22l> Tnisc_f eature 
<222> (1) . . - (395) 
<223> n = A,T,C or G 

<400> 10 

tggtatctga cnnaataaga atgcacccat ttgtgagggg taatatttat ctcangattt 60 

actgtaaata tgtatacaca catacaaaaa cccaggcatt gttaagagaa aatnatggcc 120 

cagaggttna aattatcaga cagaaccttt aanaataatt atgattaatg tgttaaaatt 180 

ctagtggaaa agataaataa catgctcagg anattttagc anagagatag aaactatntn 240 

ngaagctcaa atgaaaatgc taggaaatga aaagcagtat tggaggtgaa agattccttt 300 

ggcaatttat caacanactg gagatggcan aggcataatc agtantattg aaggcagatt 360 

actatntatt atncaancaa aaaaaaaaac cccct 395 

<210> 11 

<211> 331 

<212> DNA 

<213> Homo sapien 

<220> 

<221> misc_f eature 
<222> (1) . . . (331) 
<223> n = A,T,C or G 

<400> 11 

aacgaggccn ngaggccaat gaggccaaca agacgatgcc ggagacccca actggggact 6 0 

cagacccgca acctgctcct aaaaaaatga aaacatctga gtcctcgacn atactagtgg 120 

ntcgctacag gagggaacgt gaaaagaaca tctccagagg aactggtgaa tgaccacgcc 180 

cgagagaaca gaatcaaccc cgaccaaatg gaggaggagg aattcataga aataacgact 24 0 

gaaagaccta aaaagtagca agaagctaca tccctcaaac ttcggcaatg aaaataaagt 300 

ttgagaagct caaaaaaaaa aanccctttt g 331 

<210> 12 

<211> 693 

<212> DNA 

<213> Homo sapien 

<220> 

<221> misc_feature 

<222> (1) . . . (693) 

<223> n = A,T,C or G 

<400> 12 

tncaacgcgt tgggagctnt nccaaggtgg nctagcnnca ttaatgccct accgtgggaa 60 

tatggntgaa gatcttgact aggggactta tgaacccatg cagccgtgcc caaatcctac 120 

caaactgacc ttactttctt gaagacggaa ttgtagtatg gtcgagctca tgctttttgt 180 
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agtaggccat ncaaattcga ttgactggct aaaaaagatt gttagtggag gctggaagaa 240 

acattttggc tgatgataga tgaatagagc ttggaacaat caaaaggaaa agcagaaagt 300 

ctatacctat tcataagaaa aagttagtat gtttaccgaa cattatnaaa gaattatgac 360 

attttcaaag ttttaaaatt ttattttgta gggacggggt ctcattgtgt agcccacnct 420 

ggtctgtttc ttgaggattt actatanact gggctgtatt caaagcattg gggatacagg 480 

catgaatgag cccccattgc ctgaactcac cattcaatct gggcagtgaa agaanaggga 540 

tgntgggaga nccttacaaa gatgaaatgt cgctaactgg agaaatccct actttcagtc 600 

agactgaann ggaacaggta gtnactgtgg gtagccctct ttgggnangg gtngattttc 660 

cacatgtgcc cagttaaggg ccnagaacat taa 693 

<210> 13 

<211> 305 

<212> DNA 

<213> Homo sapien 

<220> 

<221> misc_f eature 
<222> (1) . . . (305) 
<223> n = A,T,C or G 

<400> 13 

ttggtatcng gggatgggng aggggagata gncccgaagc atcccnnatt ctcagtaaac 60 

tccttggnat canannatat cntggccnaa gaaccncnca ccntctntgg gttagaaata 120 

ccgctntatn gngtatgagg ggatngggcn tacgnnataa tttnctatng ganggtattn 180 

ccgcactant gacnagttct ttctnnggtc catttnnaac nacantnttg acattgntga 24 0 

tctgcaannc tgtaaaatag tcttncagtg ggcaatnnnt gcacaactgg gttnggtntc 300 

anaca 305 

<210> 14 

<211> 308 

<212> DNA 

<213> Homo sapien 

<220> 

<221> misc_f eature 
<222> (1) . . . (308) 
<223> n = A,T,C or G 

<400> 14 

agcagacaac ntaatccaag ccatttacca aataantata tgcgatgcac attgaatcct 60 

ggcgctctag atatantgcc ccaaaggaaa gagnacaaag tnttccnccc ntagttctac 120 

natgnctatc cnctatcacc tnctgnttcn naagntttnt aaaaataaat tctcttgtat 180 

ancatccnat atcncaccgg tccaaagcgc aacaatctgc aattcanaan ttccaacaat 240 

cnatntatgn actttcntag gtccggtgtt ctaanatnta atatcctaac acttactctc 300 

agatctta 308 

<210> 15 

<211> 304 

<212> DNA 

<213> Homo sapien 

<220> 

<221> misc__f eature 
<222> (1) . . . (304) 
<223> n = A,T,C or G 
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<400> 15 

ngtnaaggga tatttattcc tgttttaaaa ggatacaacc aaggtaggga aggcttcgtt 60 

attggtgatt attcagaaga cctattttct ttacatatgc tatggaaaca atactgtttt 120 

ccgctacaga atacagttta tgattatact tttgtaaatt gcctgctttt cccctgtcat 180 

ctgctaattc caatttgata ctgttctgtg tccaaaaata cagcatgagc aagctgtaat 240 

ggtgcctgtc gagagtccca gctgcttggg gggctaaggt gggaggatca tttgagccca 300 



<210> 16 

<211> 703 

<212> DNA 

<213> Homo sapien 



<220> 

<221> misc_feature 
<222> (1) . . . (703) 
<223> n = A,T,C or G 



<400> 16 

ccggtngnct aaaaaggacc agcctaatgt agaaggtggg tatttggacc agaggcttta 60 

gattattatt ttagatccta catatacttt tatcagtaga atgatttcat tnagatgtat 120 

aatgaaaaag ggtaatgcaa aaattatgta atagatacca aattagggaa gtttggcaat 180 

ttcaatggca tatttttagt caaggnacac agatggcagt gccataagca agtctataaa 240 

tatcggctgc agecatcccc ctcattttaa atgttgccct aataatcaat gcagttaaca 300 
agtatattgg ctgtgtgtca tgaaatagtt catgttcaga tggaaatgtt aggttactgt . 360 

atggtttatg gagattaatg aaaatgaatg cccaaaaaaa aaannccntt tngnggnggg 420 

tttnnnangn acngggctgg attcaaanca ttggggatnc angnttnaat gngnccccat 480 

ttgnctnaac ttaccttnna nnntgggcnn tnnatngaan angggatnnt gggannaacc 54 0 

tttnnangnt nnaantgtnn ncttactggn gnaaannncc ntaanntttn nnnntnnnnn 600 

ngnaangggg naannnnnnn ntnancttnt gggggagncn nttntggggn anggggggnt 660 

nnttnnnncn tnnnggccnn nnnnggggcn nnaaantttt tgn 703 



<210> 17 

<211> 171 

<212> DNA 

<213> Homo sapien 



<220> 

<221> misc_feature 
<222> (1) . . . (171) 
<223> n = A,T,C or G 



<400> 17 

tccgcntcta agtaattcat caataacgca tgtccactta atgtgaaaat tggtaccatc 60 

taatanaatc ttcaacatgg cnatccacnc tattccaata atgaaatgca aatttccctg 120 

ccttctttac tanggtcatt tntagattct tgaggaatga gttctactct t 171 



<210> 18 

<211> 689 

<212> DNA 

<213> Homo sapien 



<220> 

<221> misc feature 
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<222> (1)...(689) 
<223> n = A;T,C or G 



<400> 18 

antnngcttn ggtactaagc agaatcactt ncttgggaac tccatgtaac tngtggqttL 60 

tgtgattgaa atagcatcag taaangtctg accctgtggt: aaagacacat atgngcgrtgg^ 120 

accnggctat gtctgacttt gtgctgctca ggacactccc tgtnaccaaa agngagagan 180 

cctggannac ctcanggggt canatgtttg aaggagctgc tgagtatcct ggcaggcanc 24 0 

anagccttac catcagtttg ctgcatggaa ggctgtgtgc ctctatttcc ctgctatttg 300 

ttgaactccc ttgagctccg gtccttccta agtgagagag atgatcccaa tagcnccaac 360 

ctgagagggc tggggagatg ttngaaggaa agcttggctg gggagctgaa tctggcctgt 420 

ggtacatgct tggtaactgg tggccaggan acccgggngt gtgtnctggg actgtcncac 480 

tctgctgacc agggtattga aagtccccnc tcaaanacac agaatntntc tgaccaaggg 54 0 

tangtatgan atgacntgtg gagcactttg nataaactgg ttctcatngg nggtcccctt 600 

gaanaggtgc tnnatctgtt caaaaatacg tggctgagct ntanacccng natcctctgt 660 

cagagacatg ggcaggggga ctcaatgct 689 



<210> 19 

<211> 721 

<212> DNA 

<213> Homo sapien 

<220> 

<221> misc_f eature 
<222> (1) . . . (721) 
<223> n = A,T,C or G 



<400> 19 

tatanatact nngctatgct ttctaccctg tgtgcctgga gacctactat ggaaaaanga 60 

tcagccacct taccttctac tgggtacctg ctgtgagtct gcctatgcca caacgattaa 120 

tgangggagg gtacccaagn gacaaanccn acatgccgct tacagccccc gttggatngn 180 

tgctcattca acagtcttgc attcagtagg tgtttgacat cacctactat gtgncaggct 240 

ctatgctang nactggggat acaggagaga ntnaagcgta aagtctttgg tctcaaggaa 300 

tttgcattct agaaagtcta agatgtaata aatgtactgt gggacatgtt aaataagtgc 360 

tataacgaaa tataaagggt ttgggagcaa aaaanaaacc cnnttgtggg gntctntncc 420 

nctctgatga agcttactta cttttaacct tnccttctcc tttaaaggtg tttcctggtt 480 

cccctttcct ttacagattg gttattggtc ttgctgagga gtaggactac aattnccagc 540 

attctnctgg aagccaaagc tgtgctacaa ttgnnccaaa gaagatngta atcttaagcg 600 

cccntaatgg taaaatngta ttaaaangtg gacctttgac aaataaattg nttcgatttc 660 

ngaattccgg gttngnagct tngngntncc aaaaaccctt nggggntccc ttttgggcac 72 0 



<210> 20 

<211> 248 

<212> DNA 

<213> Homo sapien 



<220> 

<221> misc__f eature 
<222> (1) . - . (248) 
<223> n - A,T,C or G 



<400> 20 

cttaaacacc ccncccatct ncnccccaga atgagntaan catactcntc nntactgnat 
ctccgtatcc gtccctacnc nggnttgtga ggtgtcatta gcngatatta ctcctcatcn 
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ncatcntgan cannatcccc catcnnccat atgntgatna nnacaaacca tnctattncg 
ccgnngaagc cnntcnnttc attggattcn tagaccgcan angtcctnat tcngacacng 
aatcggta 

<210> 21 

<211> 298 

<212> DNA 

<213> Homo sapien 



<220> 

<221> misc_f eature 

<222> (1) . . . (298) 

<223> n = A,T,C or G 



<400> 21 

ggtctaaggg atgtgatgng agcatagaat ttanctntat ggncatanta gggacatntg 
ctgatntacn tggnctgcgg tcnntgaaag gtggngnatg atgactgatg tcatnagtag 
tacnanggac tncgnnanct gggatcnggg nttacnttgt tcatngtnag agtgnnancn 
aagtanatgn taggnataaa gatgttncgg gagatgggtc tacaaantct tttnaagatg 
ntcatcttga anannatcaa gtgtgnttgg tataatgact atcattatac aatgtcaa 



<210> 22 

<211> 591 

<212> DNA 

<213> Homo sapien 



<220> 

<221> misc_feature 
<222> (1) . . . (591) 
<223> n - A,T,C or G 

<400> 22 

tcgctagant actattcggc cgcaacgggg agcctgatga ggacgcttat gatatgagga 6 0 

aagcactttc cagggatact gagaagaaat ccatcatacc attacctcat cctgtgaggc 120 

ctgaagacat tgaataaccc tgggcagtgg ttcttaggca gatactctag atgctttatg 180 

gacaatatta ttttcattgg atgattctgg agccctatta ggagaaaagt aatcatttta 240 

ggtcttaaag acttcaagaa aatacaggtt atcaatttat tttaaatctc attgtttcca 300 

gttagcaata tcatacctat taaagctgtt cattgtaaca aaattcaatc aaaaaggcag 360 
ctaggtcaga aggaaacata ccactctcat ggttcatagt attcactgta tgtatgctag _ 420 

ggaaaagact tgctccagtc tcctcctcag ttctgtgcct gagaaccact gctgcatata 480 

tttgttttta aattttgtat tgaactgtta attgaagctt taaaagcata tatgaaatgt 540 

ataaatctaa gatgtataat acattattga ctccaaaaaa aaaaacccct t 591 



<210> 23 

<211> 755 

<212> DNA 

<213> Homo sapien 



<220> 

<221> misc_feature 
<222> (1) - . . (755) 
<223> n = A,T,C or G 

<400> 23 

gnnnnnngtt nnnnagcngg ttnggtncng actcccnttt atnatgaggg acactgaggc 
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ttcaagagat taggagactt gttcaaagac. acacagctgg taagtgatgg aggcaggatt 120 

taaacctggg tttcactgca tttcccatca ctggctttta gccatgatgc tctactgtgt I8O 

aaccctctta attcttgacc tgtggctata aagtatgtat tgagagacag gccctccctg 240 

agataacttt ccagccttga caaaggcaca cccttggttc attccttgga gtgtaggacc 300 

tagattgtga caagcccaga tgagtgtgtc tggcagaggg gagcagatct gaggccacca 360 

tatgtgttca cctagcccta aggagtgcca gcttcgctgg tatttgtaca gcttccatca 420 

ggactgctca ttggccacgt tctttcctct ccctgccacg ttgattaata ctcacataaa 480 

ttaatgctca cattagtgtt caagtatgca aatgagtgct taaaatcatc actcacacaa 540 

tgaccagact gaggatataa cacacaagag cccctctcct ggtaacccca caatcatgca 600 

gatgtgttga cttctctgca ttaccagtct ggtaggcagg gggatatgac agttagaaac 660 

agtctttcan acagcagttc tcaacaccag gtcccttgct gcacaatcga atcacctggg 720 

ggtttaaaaa aatatcatgc cagtcagcca cnntt 755 



<210> 24 

<211> 513 

<212> DNA 

<213> Homo sapien 

<220> 

<221> misc_feature 
<222> {!)... (513) 
<223> n = A,T,C or G 



<400> 24 

ctttctaccc aacaagcata gaatatacat tgtatacatc agaaacacgg gacattctcc 60 

aaaatagacc at*atgatagg gcacaaaaca agtctcagta aatttaagaa aatcagaatt 120 

atatcaagta ctctctcaga ccacagtgga ataaaattgg aaattaattc cgaaaggaac 180 

actcaaaagc atgcaaatac atggtaatta aataacctac tcctgaatga ttgttgggtc 240 

nacaatgata tcaagaggga aatttaaaaa ttctttgaac tgaacgataa tagtgacaca 300 

gcctatcaaa aactctggga tacagcaaaa gtggaggtaa gaagaaaatt catagcatta 360 

aatgcctata tcaaaaatct gaaagagcac aaataaacaa tctaaggtca ccctcncaga 420 

cittggagaaa ctagaacagt ccaaatccaa acccngcaga agaaaagaaa taaccaaatc 480 

cgaacaaaac taaatgaatt gaaaaaaatc ccc 513 



<210> 25 

<211> 574 

<212> DNA 

<213> Homo sapien 

<220> 

<221> misc_feature 
<222> (1) . . . (574) 
<223> n - A,T,C or G 



<400> 25 

cgatccaaga gattagaanc ccntggagtg gagcatgctt cnctanaatn ccacctgatn 60 

cttggctnaa nacantnngc tctantttgc tttgtgcccg tccacacaan ctaaaaacaa 120 

gggatggggg gaccncnagt gtctaatatn cntaatatcc ntccncnggc aaatgaatac 180 

tttttacaca cttgtanntt ntggagggan ggggtnatna tgaggggaan gggaaaggat 240 

gaggagaaat ccaggatnan angtctcttc gtcctctcna gactncctca cactctntgt 300 

ggtnaccngg gttcgttntg tccaatggca gacattatac tccatantct acccnggctt 360 

nntcgggttg ggacgccann actcccccna gtngtnnccc ccnancagcn atacacaagt 420 

ntgaacgggt tttgtggcca ntcatcgcaa tgaccttntc ctcnactcna agaaaantaa 480 

accccttccc ccngattggt ttctaaatct ttcaccccat ctaaaataga aagcnctnag 540 

t9993ngggt tnatcccccc nttaccntta aaac 574 
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<210> 26 

<211> 185 

<212> DNA 

<213> Homo sapien 

<220> 

<22l> misc_feature 
<222> (1) . . . (185) 
<223> n = A,T,C or G 

<400> 26 

gnacnattgg caatgacnga aagaatttga angatgnaca agtnaaagnn acagtggcaa 60 
agaatcttcn gggcgcgtca aaacaattgg gtgnattaag gacaanctcg gtcancagta 120 
taanctctct ttcncgngga ttantngnca taatcatnat tctgacnngt aggacattnc 18 0 

caacc 

<210> 27 

<211> 270 

<212> DNA 

<213> Homo sapien 



<220> 

<221> misc_feature 
<222> (1) . . . (270) 
<223> n = A,T, C or G 

<400> 27 

ttctggggct ctatacaggc tcctatttng atccangcgt gctgatgagt gcacagcacg 60 

atcacatctg gaaaccacca ntaccaccac cactacgcac ntcaccaaaa ctgtganagg 120 

gggcatttca gagacaanaa ttgaaaancg aatagtcntc acgggggnat gcanacattg I8O 

accatgacca ggcgctggct caggcagnta aagaggccan agatcaacac cctgacatgt 24 0 

cngtgaccag agtggtggtc cttacanaga 270 

<210> 28 

<211> 758 

<212> DNA ^ 

<213> Homo sapien 



<220> 

<221> misc_feature 

<222> (1)...(758) 

<223> n = A,T,C or G 



<400> 28 

tgctaggtan aaagttacct ctaagggaag ctctgcagaa gaaatcagtg aaatactctg 60 

aaagccgcaa ttacaatcaa gaggaaccta cttccctcct ggcaaagaaa cccaaggaag 120 

gcgagcggaa gatttacttg gcaattgaaa gtgccaatga actggctgtg cagaaagcaa leo 

aggcagaaat caccaggctc ataaaagaag agctgatccg gctgcaaaat tcataccaac 240 

caacaaataa aggaagatac aaagtcttat agacatccgg aaaaaagatt tttacctgtg 300 

ctggtctatg atgtatgtgg cagttgctgt ctgcagttta caatgtattg tnaatgaaga 360 

ttttttaaat tctatcttgc tgattttttt taaatataan aaactggtac ttggtaaaga 420 

aatctgtccg taattncccc ccaatcagtc caactatatt taaagccacc tgttttcnaa 480 

ttttgatntc ctttaatgtt nactccaata tccatatttt aaatgtcccg gataatatcc 540 

caaaggttta aaaaatggaa atntttgaac ttcnnttgaa nanaataaat tcccatcctt 600 
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tangggntnt ccccttnccc gttcttccaa gaaatgtgac cttccccaaa aaagntnatc 
cctanctttt tgnttccccc ctgantttct gancccggac antnacgggt ttaaaanttt 
ttaaattttc caanncaaaa aaccntntnn ttttttna 



<210> 29 

<211> 577 

<212> DNA 

<213> Homo sapien 



<220> 

<221> misc_feature 
<222> (1) . . . (577) 
<223> n = A,T,C or G 



<400> 29 

ctgctaggta ntaanattat ggatccacat tgtnctgagg anacgaanat acttgctgct 60 

gatngaggtg aaaacgatat tgatccntct ggggttttac ggtgtgcact gggtgctgca 120 

cnnacttgtc aaggtttgnt acgtcctctg ggcatctgca aaaggccctg ctctctggag 180 

tgttgtatgt agtgtaccaa aanagtattt atacatccca ccaatcaaaa cacagctttn 240 

ttacctcatg cgaactcatn caaaccaata gaatntcaac atgttctgta ccttanagtg 300 

ctcacttact acctctgaac natactcacg ctgtnntttg tctcttnctt atctttttgc 360 

^^.^.^^g^33t taactctttg tttcccttca tcaaatgtaa tgtanatcgt gatctattaa 420 

aanaaaaatc anggttgcac ttgctacttt naanaaaccg antgtggaaa cattgggtct 480 

naattcacac aggatcngta naactgttgt ggatactgag aaacntttga atgttcctcc 54 0 

ccttattacc atcccgcaaa aaaacccctn tnntttt ^'^'^ 



<210> 30 

<211> 449 

<212> DNA 

<213> Homo sapien 



<220> 

<221> misc_feature 
<222> {!)... (449) 
<223> n - A,T,C or G 



<400> 30 

tttacccaat aanntatagg cgatagaatt gatacctggc gcaatagata tagtaccgca 
aggganagat gaaaaattat aacnaagcat aatatagcaa ggactaaccc ctatnccttn 
tgcataatga attaactaga aataactttg caaggagagc caaagctaan accnccgaaa 
ccagacgagc tacctangaa cagctaaaag agcacacccg tctatgtagc anaatagtgg 
gaagatttat aggtagaggc gacaaaccta ccgagcctgg tgatagctgg ttgtccaaga 
tagaatctta gttcaacttt aaatttgccc acanaaccct ataaatcccc ttgtaaattt 
aactgttagt ccaaagagga acagctcttt ggacactagg aaaaaacctt gtagagagag 
tcataaaaaa aanccctntn gggnnnngn 

<210> 31 

<211> 500 

<212> DNA 

<213> Homo sapien 



60 
120 
180 
240 
300 
360 
420 
449 



<220> 

<221> misc_feature 
<222> (1) . . . (500) 
<223> n = A,T,C or G 
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<400> 31 

tcntggaccc nggtccccnn gngancaaan aagaagggcn ngnttncatn gaaaancctg 6 0 

tgattntcgc cccggtncag gtgttnannt atggcccncn cncatctggt atacgccnaa 120 

acaatntant tttacaatnn gtnccccanc aaacaangtt cgtngnnttn actaggtagt 180 

taatcccncc ccatgttcaa ataaagggcc cgcgntncna ataaggaanc cnccccgant 24 0 

ggggtccccg aggccctctc cttcataaaa nncattcaac ttccctcccn ctannaaaga 3.00 

aattnttcna atttttnaaa cactccctgt ccanggggac tttnccccca ntanctgaaa 360 

aaatngcntg acgttcccct tcggcctaag ggcncaactt anttnncccc caanacccgn 420 

gggnnaggnn naaactcccc tngaagggaa cnactcgcnt aaaaanggaa taatcncccc 48 0 

cnaattattc cctncccggg 500 



<210> 32 

<211> 426 

<212> DNA 

<213> Homo sapien 

<220> 

<221> misc_feature 
<222> (1) . . . (426) 
<223> n = A,T,C or G 



<400> 32 

gtctatgatc acatctgacg ctattcctat cccctccctc cccgggacct ttTiccccttc 60 

ctccctggga ccttttcccc ttcctgttta anaanccagg gccgcctgga ggaagctttg 120 

tcagatctag tggaatgtga cctccctgga atatgtgccc aggggtttgt ctaagcagtt 180 

tcaggctatg gcctttactc catctggtcc ccatccctct tatctctctc atgtgtggct 240 

gcacctggac gcttggacca tagctgtcac agccccctgg ggaggaaccc actccttggc 300 

catntcagcc tgtgcaatgc aaggctcttg tttgatctgt gtgctgacan aaagcccagc 360 

ttccttaaga acttttcatg tggaacactt tggttttgan aagaaaataa atcanaaacc 420 

attaaa 426 



<210> 33 

<211> 375 

<212> DNA 

<213> Homo sapien 

<220> 

<221>' misc__f eature 
<222> (1) . . . (375) 
<223> n ^ A,T,C or G 



<400> 33 

ngttgcacct attggccngc tggtctcgac tcctgacctc gttatctgcc tgcctcggcc 60 

tcctaaagtg ctgggattac aggagtgagc cacagtgcct ggcctgtcaa gacttctctt 120 

aagttaactt cctgagaagt gacgtctaaa agtatctttg ctggtgtgag aactccagtt 180 

tccaacacat attatttccc tcaactattt ggaatatttt agaattttaa ttccaaagga 240 

ttagtttgaa tacaagtatg ccacataact cagttttcgc catcttncat ttcttaacag 300 

tgtaaattaa aagctaataa tcataataat aaagtgcatt taattatctt cgaaaaaaaa 360 

aaancccttt tgggg 375 



<210> 34 

<211> 809 

<212> DNA 

<213> Homo sapien 
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<220> 

<221> misc__f eature 
<222> (1) . . . (809) 
<223> n - A,T,C or G 



<400> 34 

ttgcacatgc tggccaggat ggtctcgatc tcctgacctc gtgatctgcc cgcctcggcc 
tcccaaagtg ctggaactac aggtgtgagc caccacgcct ggcagctttg tgttcttttc 
tttctgtgat cttgccttag atcacacaga taaaacatga caggacctgg accttaacac 
agtttggctc tcaatcctgt tctcataacc acnactgcct tcatttatct gtgtcatcct 
cagacctgac acatagtagg tgctcagtca gtgttcacta agtaaatgat gaccaagaac 
tctttgactg ggtccaaggt gcttatccca atacttcgcc atggctacct ccctcattcc 
tcagctgact tgctctctct agcctggctg ctcctatttt atttcctaaa catggaccca 
tggcaataag tttaaancta acangttgat acggtaccca tccataattt aatnaattnt 
ggggctcatg caaccncaaa aaccagaacc caaaactacc tgtncncaaa caacaatcat 
tttnggtngg gatcccntnc tngcttggnc ccttttttta aaatgtccat tccccccgga 
cttraagaaa ttgaaggaat ncccggaaan tattgttanc gggccccctt nagngaaaaa 
ggtggcnctc cnnncggggg ccctccctgt ccctgaaatt tnaaaacccc cctcccnntt 
taaanccctt aatcccggnt aacancnaaa naaaattcta gggcccaaac ccannggttt 
ggttttaaaa aaccntntat ttttttnat 



<210> 35 

<211> 192 

<212> DNA 

<213> Homo sapien 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
809 



<220> 

<221> misc__f eature 
<222> (1) . . . (192) 
<223> n = A,T,C or G 



<400> 35 

caccttattg ggatacagca gtgaattaag ctattaaaat aagataatga ttgcttttat 
accttcagta gagaaaagtc tttgcatata aagtaatgtt taaaaaacat gtattgaaca 
cgacattgta tgaagcacaa taaagattct gaagccaaaa aaaaaaaccc caanggggnt 
nnttttnaaa aa 

<210> 36 

<211> 368 

<212> DNA 

<213> Homo sapien 



60 
120 
180 
192 



<220> 

<221> misc_feature 
<222> (1) . . . (368) 
<223> n = A,T,C or G 



ctgctagtac caantattat ttaagantac ttttcactac tcctaaataa tgacacagat 
acgtttgtct tacacatttc actttattgt caagttatta gtatgtttat tttcaaaagt 
tattttttgc aatttctttt tattattccg tactttttaa atttacttca ttatcacgtc 
ttcctttatt ctttttaaat agtttttgct tttgttattt tgttttccct tttttactct 
tggtttgtaa tacctctttc cttatttgct cctttctcat ttgatcccaa tgttaatcca 
actgttttcc acatctgatt cactaaaatt ttagcccaaa aaaaaaancc cntttngggg 



60 
120 
180 
240 
300 
360 
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gngntttt 

<210> 37 

<211> 219 

<212> DNA 

<213> Homo sapien 

<220> 

<221> misc_f eature 
<222> (1) . - . (219) 
<223> n = A,T,C or G 



<400> 37 

ggccccattt cactctccat antggcnctt nctngaacag gcgtnctgga tnagtgcaca 60 

• tacnatccca tcnacntgca cctatancnc ttccactacg cacatcacca aanctgtgaa 120 

agggggcntn tcnttagaca cacaattgca gaatngacnn cncancccgg gggannctcn 180 

angttcaccn tgnagcaggn gctggctcan gctnttata 219 



<210> 38 

<211> 198 

<212> DNA 

<213> Homo sapien 

<220> 

<221> misc_feature 
<222> (1) . . - (198) 
<223> n = A,T,C or G 



<400> 38 

tcgatacagg gncagatctg ggagccaggg cgttgctgat gagttgcaca gacgatcaca 6 0 

tctgaaacca ccagtaccac caccactacg cacatcacca aagcgctggc tcnggcaatt 120 

aangaggcca aagagcanca ccctgacatg tcngtgaccn ttgtantggt ccntaangac 180 

acngacatcg cctccaca 198 



<210> 39 

<211> 560 

<212> DNA 

<213> Homo sapien 

<220> 

<22l> misc_feature 
<222> (1) . . . (560) 
<223> n = A,T,C or G 



<400> 39 

tttnnatcng nacagctagt cctntaaant aatgacttca tagaaatggc attataattt 60 

ttaagttgat actctacagg tagctattga tataattagt tttaataaaa catgctgcaa 120 

ccatggtata caacaaaaat acatttcttt ggtgattgaa attaaggccg tatttacaat 180 

gacttaatat aagactgact tttatcctgc ttcataactt gtatggagaa ctcaccaaga 24 0 

aagaattcaa tactgtgaaa tatgcagcaa gaagattggt ctttacctag gctgtgtttc 300 

ctaagctctg agttttcagc accagtagat ttgtattaaa agaaaaaaaa atggggcctt 360 

agcttctggc tttt.aatttt gccagctaag gacataaaac aaaantaanc aancaaaanc 420 

aaatagccat ntgctatcag catcattatg taaaagaaaa tntattttag cccctaaaat 480 

taggaagaat gtaatctcag aataaaggtt gtcatttaag ttgaataaat atntagcttt 54 0 

cgaaaaaaaa aanccccttt 56 0 
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<210> 40 

<211> 421 

<212> DNA 

<213> Homo sapien 

<220> 

<221> misc_f eature 

<222> (1) . . . (421) 

<223> n = A,T,C or G 



<400> 40 

atacagggca gcgtgttagg tgaccacacc aggagcctca gcctcggtcc ttctcagccg 60 

tcgggataag atccaggcat gncttttaaa tctcagaggt agcagtaaac ttttcantnt 120 

tgcngttagc aagtgtgtgt ttgccaataa anccccatta tactaatgtg cctanttaat 180 

gttcagggaa natctgcttc cactgtgtnc cnaggggtgn catgaactnt gtgagnagcc 24 0 

ccncnnctgg agggatgaat gctgngttaa ctacngctat cacggatngt gcgntgtgaa 300 

naatacatcn acatnaatnt tanntgctct gnaanttccc ttnttatntg tcaagtaact 360 

ntttgtaaaa ntnntnctcc caanttatta cngtgattac taatnnattn gtnccatgtt 420 

t 421 



<210> 41 

<211> 411 

<2i2> DNA 

<213> Homo sapien 

<220> 

<221> misc_f eature 
<222> (1) . . . (411) 
<223> n = A,T,C or G 



<400> 41 

aggtagaggt tgtgcatgtt gtccttttta tctgatctgt gattaaagea gtaatatttt 60 

aagatggact gggaaaaaca tcaactcctg aagttagaaa taagaatggt ttgtaaaatc 120 

cacagctata tcctgatgct ggatggtatt aatcttgtgt agtcttcaac tggttagtgt 180 

gaaatagttc tgccacctct gacgcaccac tgccaatgct gtacgtactg catttgcccc 240 

ttgagccagg tggatgttta ccgtgtgtta tataacttcc tggctccttc actgaacatg 300 

cctantccaa cattttttcc cagtggagtc ncatcctggg atccagtgta taaatcccaa 360 

ttatcatgtc ttgtgcataa attcttccca aaagggatct ntaatttttt g 411 



<210> 42 

<211> 408 

<212> DNA 

<213> Homo sapien 

<220> 

<221> misc_feature 
<222> (1) . . . (408) 
<223> n = A,T,C or G 



<400> 42 

ggctcccctc cctaactctc taagtacttc ccttacccac tcagtgtggt gatggcacct 60 

ccccgaatct cctgacaaat gcgaacagga actcctattc atcaggagcc aacttgataa 120 

ctganaagat tcctctctca tttatcagcc tttgattatc tttttgtgtc tcttactatt 180 

tgcgcttagc gagaaaaata aagaggtttg aacaattaag aagtaacaaa gagctcatag 240 
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ttcacaaaga gcaantcaaa ggatgtctgg aatatttgaa catacaactg cctttggcat 300 
gaggtggcct acatacattc tcaggggcag gataggctgg nanagctgat caagctgccg 360 
ggaaagctga agcaaaggca gggttggntg gaaatcaaaa tntctctt 408 

<210> 43 

<211> 275 

<212> DNA 

<213> Homo sapien 

<220> 

<221> misc_f eature 
<222> (1) . . - (275) 
<223> n = A,T,C or G ■ 



<400> 43 

tccctaactc tctaagtact tcccttaccc actcagtgtg gtgatggcac ctccctgaat 60 

ctcctgacaa- atgcgaacag gaactcctat tcatcagagc caacttgata actgagaaga . 120 

ttcctctctc atttatcagc ctttgattat ctttttgtgt ctcttactat ttgcgcttag 180 

caagaaaaat aaagaggttt gaacaantaa gaagtancnn ggagctcnta gttcanaagn 24 0 

agcaagtcaa aggatgtctg gangatttga agggt 275 



<210> 44 

<211> 246 

<212> DNA 

<213> Homo sapien 

<220> 

<221> misc_feature 
<222> (1) . . . (246) 
<223> n ^ A,T, C or G 



<400> 44 

tttggtccca agcacatttc acaaangaga atttacacct agcacagctg gtgccangan 60 

atntcctang gacatggcca cctgggtcca ctccagcgac agacccctga caagagcagg 120 

tctctggagg ctnantngca tggggcctan tntcntcaat cnaatgagcc ccnantgcta 180 

ctgcgccccg ggggctccca cggcctgggc nnctttcntg caactgnaaa aggatagngg 24 0 

tatttc 246 



<210> 45 

<211> 345 

<212> DNA 

<213> Homo sapien 

<220> 

<221> misc_feature 
<222> (1) . . . (345) 
<223> n = A,T,C or G 



<400> 45 

tttggctccg tgggacgttg tantgtgcnc agacatttcc aagggaaatt ctaaacagtc 60 

accctnccct tttgcattcc cccaaatctt aagtgtatac ataaaaccct gggtacatat 120 

tgtngtggta atagaaggga attggnnaaa cngtacactt gttatatgga antnactgtg 180 

gccacctaca aaagacaagt taacaaactg tcntggaggc tgtngntgcc canccagggc 240 

cgctgcnttt tgacaacatt cccaccctgg ccactcagca canttcatgg caggtcatgt 300 

ctntncactg anacntttnt ganacttttt catatagcan aatcc 345 
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<210> 46 

<211> 969 

<212> DNA 

<213> Homo sapien 

<220> 

<221> misc_f eature 
<222> (1) . . . (969) 
<223> n - A,T,C or G 



<400> 
aattgcagtt 
gtccttccaa 
ggcaagaacc 
tatttagata 
ntagtatatc 
tacaaaaaaa 
ccntgtngcn 
cnggtctctg 
nctngtnnng 
cacanntcca 
tgt entente 
ggcttttatc 
gnntctcntn 
tggngegant 
nntgagannc 
nctcatatcc 
tgcccttnn 



46 

ctttcttgce 

ctctgtgatt 

antatgagat 

taattcccat 

cacaaagttg 

aaaaccccan 

tctagnncng 

ctcngngnnn 

tctgcnncgt 

nceanctgcc 

tcegtnecnc 

cccctntecn 

tnncnnnann 

cncgctctgn 

ccncccnctt 

nccatgetgn 



tttaacaaea 
etctgattcc 
ttgaggaaaa 
accatacaat 
tgctaccatc 
agtnanttec 
ngggntnnct 
cgntntgnct 
agccagtggn 
atnagtnana 
cngengctnn 
nctnttctng 
cgttnctntn 
gtatcagtea 
cgcatnaegt 
catganactc 



ttagggcctt 

atcctcattt 

atacttggat 

taaccttttt 

accactatcc 

tttcaaaacn 

tttgtcnntn 

tnnantcget 

cetcctgntn 

nnatetctnt 

ctcnttncge 

atggnntctc 

tnncttnccg 

tntanagann 

angtgncttt 

cntantctnn 



tagaatgagt 

tteaceatca 

tactettttt 

atgtgtataa 

gattccagag 

ctttnngttn 

tencectncn 

getnntcntg 

ccnncngntt 

tcnncanetg 

netggnngnc 

ntctcnacac 

ncngccatct 

ngngnntgtt 

ntnnatctgc 

cgcnhttctn 



acctggtgct 

ctggtgtaet 

taaaaaaaat 

ttcagtattt 

cttgtcatca 

ttcnttntnc 

ctcatcntnn 

tattccccgc 

ctntntncgg 

ntnneagnnt 

antcgtacct 

ctgncgttac 

nagctcannc 

nccnncgcgn 

tcgtcgtetc 

ncgttccctc 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 

969 



<210> 47 

<211> 361 

<212> DNA 

<213> Homo sapien 

<220> 

<221> tnisc_f eature 

<222> (1) . . . (361) 

<223> n A,T,C or G 



<400> 
ggccaetaag 
gaccacaatg 
ntgtggaant 
tcaaaaactc 
tatcacaagc 
agatctcana 
t 



47 

caggtcttac 
gtatgaaaet 
taateaacnc 
tcttgaggtg 
ngtgctaagg 
tanaenaccc 



enaatttaag 
agaaatcagt 
atgagcaaet 
gatgagaatg 
gggaagttta 
agcnttncnc 



aanattgaan 
aacaggagga 
antgagtcna 
ganatacaac 
agtnetagat 
etcgaanaac 



tcctatcaag 
aaattggaag 
agancanatc 
ataccngaac 
gtctanatta 
tagaaaaact 



tatctcttct 
attcacaaat 
aaaagggann 
tcatgggatg 
ngaaagggaa 
aagaaaaaac 



60 
120 
180 
240 
300 
360 
361 



<210> 48 

<211> 364 

<212> DNA 

c213> Homo sapien 



<220> 
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<221> misc_feature 
<222> (1) . - . (364) 
<223> n = A,TX or G 



60 
120 
180 
240 
300 
360 
364 



<400> 48 

atgatgacca catntagatg gcacatngat gaggacttta atctttcctt aaanacaata: 
atgtgttctt ttctcttitta nticacatgat ttctaagtan, attttncatg caggacactt 
tttcaacctt gatgtacant gactgtgtaa aatttntctt tcagtggcaa cctctataat 
ctttannata tggtgagcat ccngtctgtt tagaanggga tatgacaata aatctatcag 
atggaaaatc ctgttacaaa gtataaaagc tttagtaatt tactcagtgt ggtggtttta 
tcctttttgc tttttctccc ttggtctata atgaaattgt tacagcagtg caaaataaaa 
ccct 

<210> 49 
<211> 703 

<212> DNA . 
<213> Homo sapien 

<220> 

<221> misc_feature 
<222> (1) . (703) 
<223> n = A,T,C or G 

<400> 49 

atggggaatc aaacaatgtt aaaaggctan taatacttac aggttttatg attcaatcta 

ctatgtgttt aaaattgttt tttgaaaaaa ttgagttatg tcnctaaaac tgagtctnta 

cagctcaaaa atgaagaaat acntatctcc gataagcata tuatgtgaat ttcaacatcn 

ccattgagaa aaggaatata aatttgaatg aaaatgaaac. tctatctttc tatatcacat 240 

tgcataggtg taggctagtg agtactttga tgtaaattgc tgtatctctt gaggcntcna 

tttggcnata tagatcagaa ttttaaatcn gcatactttg tttgccagaa atctatcagg 

accacttgta ntnattttgt tnaaaggaat atcnaacnct tggatgttca ncncagtatt 

gattgtttta naagaaggaa aiiggagaaag ggaggagaat ggaaganana aanggaggga 

ggaanattgg aaccnttgac atntgtgata gcatnggatt tgctnaacac nctatantat 54 0 

acccctngca tggganaagc atgcacnctn aaacaaggac nngttngatg gntctacnnt 

ttgacntcag atnnaantaa atnaaaaaaa aaancccccn cctctttgnn ttcctntcnn 
cgnnnnannc ntctccccnc nncgnccnnc ncccgccacc ntn 



60 
120 
180 



300 
360 
420 
480 



600 
660 
703 



<210> 50 

<211> 413 

<212> DNA 

<213> Homo sapien 



<220> 

<221> misc__f eature 
<222> (1) . . . (413) 
<223> n = A,T,C or G 



<400> 50 

tcttggctgg ttgagtattc aanaatcagg cacggagaag tggggtggac gcaaaccaac 
tgaccactgt ggcaccacca gcagtttcag ttttcatctt gantgtcnag aggaaatatc 
taatcttaca actcnttagg ggcctggctc agtggctcat accttgtntt cccancactt 
tgggangccg angcnggcnt atcacccgca ngtcaggatt ttgagaccac cctggccaac 
ntggtgaaac cccatctcta ctantcaata caaancttag ctangcgtga tggcatgcac 
ctctaatccc acttacttgg gangctgagg cagcganaat cacttgtaac ccggaaggca 360 
nacgttgcat ntgagccaag atcgtgccac tgcactccat cctgggcttt eta 413 



60 
120 
180 
240 
300 
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<210> 51 

<211> 252 

<212> DNA 

<213> Homo sapien 

<220> 

<221> misc_feature 

<222> (1) . . . (252) 

<223> n = A,T,C or G 



<400> 51 

gttacagaca aggnttntag aatatcttat gttttatgct ctgtaagttc aaagaagnta 60 

gcagaaaaca taagcatact gaaaagagaa acagaagcta ttttttaaat acctatgtga 120 

aatctctcta tntgaaacaa aaaatacact ggatggatta gacactgcag aaggaaaatt 180 

tggtgaactt gagatcttat aaataaaaat tatccaaaat gaagtgtaga gtgaaaaaaa 240 

aaaancccct at 252 



<210> 52 

<211> 875 

<212> DNA 

<213> Homo sapien 

<220> 

<221> misc_feature 
<222> (1) . . . (875) 
<:223> n = A,T,C or G 



<400> 52 

agaaacgaga atgganattc aaatacgtcn gccgggcttg gtggattaga cctgtaaccc 50 

naacactttg ggaggnctag gcgggcggat caccngaggt cnngagtacg ggaacancct 120 

ggcaaaaacc ccntctttan tctgngaaaa cncaactcta ctaaaanaac tactcttaga 180 

tnggcgtngn tgcgcctgcc tgttntccca gatacnnttt naggctgang tggggataan 24 0 

tnctttaaca tgggaagtgg aagttgcact gatccaatgt ctccacactg cantccagcc 300 

tgggttangg aatgagaccc cncncacgga aaggacaata aaaanccccn nnggnhttnn 360 

tttttaangg cctcttgntc nttttcttnt antgcncgcc tncgcnnncn ttgntntgtc 420 

gantcnnntg cnnttntttc ttcnncctcn ancctgcttc tnntcnnttc gccnntnnac 480 

ngcttccccc ntnctctagc acttnnnttc tntcgntccn nnatctccnn cttntctnnn 540 

ccgctcgcgt nnnccntnan ctcgnntcnt nccctttctt cncngcnncn ntttcgncna 600 

gatcgtncgn ctctatctac ttctntccnn gntntanata tngatnttac attntgctcn 660 

atnacccatn annncntcta tgtttatann ngtnnnnccn ttcaacnnnn cnttatgagn 720 

tcttnactca gctctncgtt gntnttccna ctanngttgn ncntncatgt nctgtcncgt 73.0 

ancnctctnc tcntcncngt cntgagacna atctctatnt atngnttatn cctgcntnct 840 
ganctncacc gngatctcgg cnntntcttc tcaag . 875 



c210> 53 

<211> 182 

<212> DNA 

<213> Homo sapien 



<400> 53 

ccagaagaag ggctacatat ggactcatgt tgggcctact cctgcaataa caattaagga 60 

atcagttgcc aaccatttgt agttcacaaa ttaaaactgg gtttccaggc ctggtgtggt 120 

ggctcacgcc tgtagcccca gctattgcac cactgctctc caagctgggc aatggagtca 180 

ga 182 
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<210> 54 

<211> 329 

<212> DNA 

<213> Homo sapien 



<400> 54 

catgatgcga gactggacat ctctcctacc ccatgtacac ttcagctgag caggcagaat 60 

tagagagtca ggactagaag ttcagtctag ggatcaaata ataatagtag ctaatgttta 120 

aaggtaccta agatccgcca ggagacatac tcagtatagt tccgtggttt gccacatttc 180 

atcttatcca gtagcacagg tgaaatttgt cttatgtgta tactgaggaa aaacaagtcc 240 

ctctgatacc agcagccaat aaatgacaaa gctgggatag aaacttactt cattctaacc 300 

cgagagtccc tgttcttgca tggggcaca 329 



<210> 55 

<211> 312 

<212> DNA 

<213> Homo sapien 



<220> 

<221> misc_feature 
<222> (1) . . - (312) 
<223> n = A,T,C or G 



<400> 55 

actcaactcg tttgagctat aggaatnggc cattcgnngt ggctcanacc tgtaatccca 60 

gnatttnggg anacctcact aggatcacnr gaggtcagga gttcaagacc agcctgtcca 120 

acatggngaa accccatctc tantanaaaa tacagaaatt atccaggtgt ggiiggctggc 180 

acctgtaatc ccagctactt gggaggccaa ggcatggaaa attgtctgaa cctgggaagt 240 

ggaggttgcg gtnanctgan atcatgccat tgctctccag cctcggccac anatcaagac 300 
cctatctcaa aa 



<210> 56 

<211> 565 

<212> DNA 

<213> Homo sapien 

<400> 56 

acaatttcac acaggaaaca gctatgacat gattacgaat ttaatacgac tcactatagg 60 

gaatttggcc ctcgaggcca agaattcggc acgaggggat ccaacgtcgc tccagctgct 120 

cttgacgact ccacagatac cccgaagcca tggcaagcaa gggcttgcag gacctgaagc 180 

aacaggtgga ggggaccgcc caggaagccg tgtcagcggc cggagcggca gctcagcaag 24 0 

tggtggacca ggccacagag gcggggcaga aagccatgga ccagctggcc aagaccaccc 300 

aggaaaccat cgacaagact gctaaccagg cctctgacac cttctctggg attgggaaaa 360 

aattcggcct cctgaaatga cagcagggag acttgggtcg gcctcctgaa atgacagcag 420 

ggagacttgg gtgacccccc ttccaggcgc catttagcac agcctggccc tgatctccgg 4 80 

gcagccacca cctcctcggt ctgccccctc attaaaattc acgttcccaa aaaaaaaaaa 54 0 

aaaaaaaaag atgcggccgc aagct 565 

<210> 57 

<211> 798 

<212> DNA 

<213> Homo sapien 



<400> 57 
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ggaacaagta gaagggaaga gggaaatgga gagcatcctt atgactttac aaagggtgga 60 

aatgaggatg gagggataca gaagtctgca cagctgtaaa ggttttatag atgtctttgc 120 

cttcccttct gaggaaggga agaagtaatg aaagcacatg tgaataaccc cttccatccc 160 

attcacagca tcgcactccc agtccttaag gcaaagggag gcagtgctga agcattggtg 24 0 

gtgcagtgta aagagacaag acctgatcat ctgatcacac ttgtgccaac ttgattcata 300 

ttgggcatta ctaacaaccc ctggtcaagg taaataggtt gaacaatcaa taacattatc 360 

cctgcctgca tacatgtgaa caaaagctat agaggacatg caaattctac agtcattcct 420 

catatgcttt agacagagtg cagctactgg aatcttccag atttcagtgt tttaaaatca 480 

gagctctgaa tacacaaaag gaaagagaaa tggagcagct gacatatttt aagctcacag 540 

tgatactcag tgacaggagc acagagctct aatgtccaca ggatgttgta gggtagggtc GOO 

cctcagtaaa tcaagtccct tacctatgtt ctgacactga ggctcttgga gctatgggtt 66.0 

agaaatccag gaggcaatat gtctttattc caatgaagtc ctcatcttgc actcagaggc 720 

ccactagttt gcccttctat atattaagta aaaccaagag aaattaaaaa aaaaaaagcc 780 

ctatagtgag tcgtatta 798 

<210> 58 

<211> 729 

<212> DNA 

<213> Homo sapien 

<400> 58 

aagaatagac cgagataggg ttgagtgttg ttccagtttg gaacaaqagt ccactattaa 60 

agaacgtgga ctccaacgtc aaagggcgaa aaaccgtcta tcagggcgat ggcccactac 120 

gtgaaccatc accctaatca agtttcttgg ggtcgaggtg ccgtaaagca ctaaatcgga 180 

accctaaagg gagcccccga tttagagctt gacggggaaa gccggcgaac gtggcgagaa 24 0 

aggaagggaa gaaagcgaaa ggagcgggcg ccacggcgct ggcaagtgta gcggtcacgc 300 

tgcgcqtaac caccacaccc gccgcgctta atgcgccgct acagggcgcg tccattcgcc 360 

attcaggctg cgcaactgtt gggaagggcg atcggtgcgg gcctcttcgc tattacgcca 420 

gctggcgaaa gggggatgtg ctgcaaggcg attaagttgg gtaacgccag ggttttccca 48 0 

gtcacgacgt tgtaaaacga cggccagtga attgtaatac gactcactat agggcgaatt 54 0 

gggccctcta gatgcatgct cgagcggccg ccagtgtgat ggatatctgc agaattcggc 600 

ttgtaatacg actcactata gggctttttt ttttttcggt tcgaggggga atgctggaga 660 

ttgtaatggg tatggagaca tatcatataa gtaatgctag tcttatcctg tgtgaaattg 720 

ttatccgct '729 

<210> 59 

<211> 730 

<212> DNA 

<213> Homo sapien 

<400> 59 

aagaatagac cgagataggg ttgagtgttg ttccagtttg gaacaagagt ccactattaa 60 

agaacgtgga ctccaacgtc aaagggcgaa aaaccgtcta tcagggcgat ggcccactac 120 

gtgaaccatc accctaatca agttttttgg ggtcgaggtg ccgtaaagca ctaaatcgga 180 

accctaaagg gagcccccga tttagagctt gacggggaaa gccggcgaac gtggcgagaa 24 0 

aggaagggaa gaaagcgaaa ggagcgggcg ctagggcgct ggcaagtgta gcggtcacgc 3 00 

tgcgcgtaac caccacaccc gccgcgctta atgcgccgct acagggcgcg tccattcgcc 36 0 

attcaggctg cgcaactgtt gggaagggcg atcggtgcgg gcctcttcgc tattacgcca 420 

gctggcgaaa gggggatgtg ctgcaaggcg attaagttgg gtaacgccag ggttctccca 480 

gtcacgacgt tgtaaaacga cggccagtga attgtaatac gactcactat agggcgaatt 540 

gggccctcta gatgcatgct cgagcggccg ccagtgtgat ggatatctgc agaattcggc 600 

ttgtaatacg actcactata gggctttttt ttttttcggt ttgaggggga atgctggaga 660 

ttgtaatggg tatggagaca tatcatataa gtaatgctag tcttatcctg tgtgaaattg 720 

ttatccgcta 730 
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<210> 60 

<211> 623 

<212> DNA 

<213> Homo sapien 



<400> 60 

gactccaaga gaagactagg- aagtagccct cgttctccag. ggcacccaaa ataccagcct: 60 

ttattgtctg catgatttta ggggatatgg ggagggaaca agtagaaggg aagagggaaa 120 

tggagagcat ccttatgact ttacaaaggg tggaaatgag gatggaggga tacagaagtc 180 

tgcacagctg taaaggtttt atagatgtct ttgccttccc ttctgaggaa gggaagaagt 240 

aatgaaagca catgtgaata accccttcca tcccattcac agcatcgcac tcccagtcct 300 

taaggcaaag ggaggcagtg ctgaagcatt ggtggtgcag tgtaaagaga caagacctga 360 

tcatctgatc acacttgtgc caacttgatt catattgggc attactaaca acccctgggc 420 

aaggtaaata ggttgaacaa tcaataacat. tatccctgcc tgcatacatg tgaacaaaag 480 

ctatagagga catgcaaatt ctacagtcat tcctcatatg ctttagacag agtgcagcta 540 

ctggaatctt ccagatttca gtgctttaaa atcagagctc tgaatacaca aaaaaaaaaa 600 

gccctatagt gagtcgtatt aca 623 



<210> 61 

<211> 376 

<212> DNA 

<213> Homo sapien 



<400> 61 

gcatgctcga gcggccgcca gtgtgatgga .tatctgcaga attcggctta gcggataaca 60 

atttcacaca ggatc.catga ctcagctatt aaggctctgg cctcggatcc ctatgaggaa 120 

tattttacca caggttcagc agaaggtaac ataaaggttc ggagattgac aggccatggc 18 0 

ctaattcatt catttaaaag tgaacatgct aagcagtcca tatttcgaaa catcggggct 240 

ggagtcatgc agattgacat catccagggc aatcggctct tctcctgtgg tgcagatggc 300 

acgctgaaaa ccagggttct gcccaatgct. tttaacatcc ctaacagaat tcttgacatt 360 

ctataaagat tggggt 376 



<210> 62 

<211> 539 

<212> DNA 

<213> Homo sapien 



<400> 62 

atgactcatt gtttctctgc ctttccgtgt gttacaggtg ggctgatccc cctgcagcca 60 

gtttcccata agcaactgac ttccaactgg gaatgtctcg ggggataatg ggggtgggga 120 

tatggaagta tagagaaaac ataagaaaat actgggtgta tacacctttc tctctctgag 180 

tatgatgaca atgtgatagt cagtgtggca tctgcgactc cagcttgtgc ctggcatgta 24 0 

caccctagct ccagcttccc ctgggagact gtgcatctcc tggctccact aacaccacct 3 00 

tcttctgacc ttccagccta gagatgatga ctctgccagc ctagatgggc tctgggttgt 360 

ctccctattc ctgtttgctt tgtagatttc ccattatgct gtcaccaact ccccagccta 420 

agccctctct attttaaatt ctcaagtgga ttatgttcct gattagtccc tgactgatat 480 

accactctcc tcatgatctc tgattagttt tcctgttagg ttgttgcagc aaaaaaaaa 539 



<210> 63 

<211> 304 

<212> DNA 

<213> Homo sapien 



<400> 63 

ggcttagcgg ataacaattt cacacaggac gactccaagc tgggaaggaa aattcccttt 



60 
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tccaacctgt atcaattttt acaacttttt tcctgaaagc agtttagtcc atactttgca 
ctgacatact ttttccttct gtgctaaggt aaggtatcca ccctcgatgc aatccacctt 
gtgt.tttctt agggtggaat gtgatgttca gcagcaaact tgcaacagac tggccttctg 
tttgttactt tcaaaaggcc cacatgatac aattagagaa ttcccaccgc acaaaaaaaa 
aaag 

<210> G4 
<211> 226 
<212> DNA 
<213> Homo sapien 

<220> 

<221> misc_feature 

<222> (1) . . . (226) 

<223> n ^ A,T,C or G 

<400> 64 

atgatgatga ccatgtggac agccaggact ccattgactc gaacgactct gatgatgtng 
atgacactga tgattctcac cagtctgatg agtctcacca ttctgatgaa tctgatgaac 
tggtcactga ttttcccncg gacctgccng caaccgaagt nttcactcca gttgtccccc 
cagtagacac ntntgatggc cgaggtgatg gtgtggttta tggact 

<210> 65 
<211> 225 
<212> DNA 
<2 1 3 > Homo sapien 

<220> 

<221> misc_feature 
<222> (1) . . , (225) 
<223> n A,T,C or G 

<400> 65 

taccaacaga gcttctgaaa cagataccat agcattggag agaaaaacag ctcacagtct 

gaggaagatg atattganag aaggaaagaa ttgaaagcat cttgaagaaa aactcagatt 

ggatntggga ttggtcaagt cggccggata atattccccc caaggagttc ctctttaaac 

acccgaagcg cacggccacc ctcagcatga ggaacacgag cgtca 

<210> 66 
<211> 240 
<212> DNA 
<213> Homo sapien 

<220> 

<221> misc_feature 
<222> (1) . . . (240) 
<223> n = A,T,C or G 

<400> 66 

ccagcatggt ggccgtnatg gatagcgacc cacangcaag ctgggctttg aggaattcaa 6 0 

gtacttgtgg aacaacatca aaaggtggca ggccatatac aaacagtacg acactgaccg i20 

atcagggacc atgtgcagta gtgaactccc angtgccttt gaggcagcan ggttccacct 180 

gaatgaacan ctctataaca tgatcatccg acnctactca gatgaaagtg ggaacatgga 240 

<210> 67 



120 
180 
240 
300 
304 



60 
120 
180 
226 



60 
120 
180 
225 
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<211> 504 

<212> DNA 

<213> Homo sapien 

<220> 

<22l> misc_f eature 

<222> (1) . . . (504) 

<223> n = A,T,C or G 



<400> 67 

cacgaggaga gatngcatct gctatatatt ccacngatac atgtgagtna ctgatagaaa 60 

aaatcgcnnc ggngaacact gncaccggtn ccggcccccg gtactacagg gatctcntca 120 

gacttcaccg tntactacaa ngtaagcncc ctttaagaat gtcacggagt atgatgggca 180 

ggatgcctgc ggctccaaca nctggaacnt ggtggacgtg gacctcccgc ccaacaagga 240 

cntggagccc ggcatcttac tacatgggct gaanccctgg actcagtacg ccgtttacnt 300 

caaggctgtg accctcacca tggtggagaa cgaccatatc cgtggggcca agagtgagat 360 

cttgtncatt cgcnccantg cttcngttcc ttccnttccc ttggacnttc tttcggcatc 420 

aaactcctct tctcagtr.aa tcgtgaagtg gaaccctccc tctctgccca acggcnacct 480 

gagttactac tttgtgcnct ggca 504 



<210> 68 

<211> 462 

<212> DNA 

<2I3> Homo sapien 

<220> 

<221> misc__f eature 
<222> (1) . . . (462) 
<223> n ^ A,T,C or G 



<400> 68 

tggatggcag ggggagaaag gaaaagcaaa acactccagg acctctcccg gatctgtctc 60 

ctcctctagc cagcagtatg gacagctgga cccctgaact tcctctcctc ttacctgggc 120 

agagtgttgt ctctccccaa atttataaaa actaaaatgc atnccattcc tctgaaagca 180 

aaacaaattc ataattgagt gatattaaat anagaggttt tcggaagcag atctgtgaat 24 0 

atgaaataca tgtgcatatt tcattcccca ggcagacatt ttttagaaat caatacatgc 300 

cccaatattg gaaagacttg ttcttccacg gtgactacag tacatgctga agcgtgccgt 360 

ttcagccctc atttaattca atttgtaagt agcgcagcag cctctgtggg ggaggatagg 420 

ctgaaaaaaa aaaancccct tttttngtnt nttttaaaaa aa 462 



<210> 69 

<211> 357 

<212> DNA 

<213> Homo sapien 



<400> 69 

agaagtcttc ctgagccttc catgtatcct cggtgcccgg ggattaacca gcgttatcaa 60 

ccaaagctaa aggatgatga ggttgctcag ctcaagaaaa gtggagatac cctgtgggac 120 

atccagaagg acctaaaaga cctgtgacta gtgagctcta ggctgtagaa atttaaaaac 180 

tacaatgtat taactcgatc ctttagtttt catccatgta catggatcac agtttgcttt 240 

gatcttcttc aattgtgaat ttgggctcac agaatcaaag cctatgcttg gtttaatgct 300 

tgcaatctga gctcttgaac aaataaaatt aactattgta gtgtgaaaaa aaaaaaa 357 



<210> 70 
<211> 226 
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<212> DNA 

<213> Homo sapien 

<220> 

<221> misc_feature 
<222> (1) . . . (226) 
<223> n = A,T,C or G 



<400> 70 

atgatgatga ccatgtggac agccaggact ccattgactc gaacgactct gatgatgtng 60 

atgacactga tgattctcac cagtctgatg agtctcacca ttctgatgaa tctgatgaac 120 

tggtcactga ttttcccncg gacctgccng caaccgaagt nttcactcca gttgtccccc 180 

cagtagacac ntntgatggc cgaggtgatg gtgtggttta tggact 226 



<210> 71 

<211> 477 

<:212> DNA 

<213> Homo sapien 

<220> 

<221> misc_feature 
<222> (1) . . . (477) 
<223> n - A,T,C or G 



<400> 71 

agcagacaag ccacaattaa catagggtac aattgggtca tgtagctcat gggaaatcca 60 

cagtcgtcaa agctatttct ggagttcata ctgtcaggtt caaaaatgaa ctagaaagaa 120 

atattacaat caagcttgga tatgctaatg ctaagattta taagcttgat gacccaagtc 180 

gccctcggcc agaatgttat agatcttgtg ggagcagtac acctgacgag tttcctacgg 240 

acattccagg gaccaaaggg aacttcagat tagtcagaca tgtttccttt gttgactgtc 300 

ctggccacna tattttgatg gctactatgc tgaacggtgc agcagtgatg gatgcagctc 360 

ttctgctgat agctggtaat gaatcttgcc ctcagcctca gacatcggaa acacctggct 420 

gctatagaag atcatgaaac tggaagccat attttgaatt ctacaaaata aaattga 477 



<210> 72 

<211> 374 

<212> DNA 

<213> Homo sapien 

<220> 

<221> misc_feature 
<222> (1) . . . (374) 
<223> n = A,T,C or G 



<400> 72 

ccaagccaga ttgtcactcc agctgatctt ctttgatggt gaagaggctt ttcttcactg 60 

gtctcctcaa gattctctct atgggtctcg acacttaact gcaaagatgg catcgacccc 120 

gcacccacct ggagcgagag gcaccagcca actgcatggc atggatttat tggtcttatt 180 

ggatttgatt ggagctccaa acccaacgtt tcccaatttt tttccanact cagccaggtg 240 

gttcgaanga cttcaagcan ttgaacatga acttcatgaa ttgggtttgc tcaangatca 300 

ctctttggag gggcggtatt tccanaatta cagttatgga ggtgtgattc aggatgaccn 360 

ttttccattt ccaa 374 



<210> 73 
<211> 597 
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<212> DNA 

<213> Homo sapien 



<220> 

<221> misc_feat:ure 

<222> (1) (597) 

<223> n - A,T,C or G 



<400> 73 

ccaagggatc tgtaaagaat atatacttga gtggtgtgtg ttatcagata aagcaccctg 60 

tatcacagac tggcaacaag aagatggtac cgtgcatcgc acctatttaa gagggaactt 120 

agcagagagc aaatgctatt tgataacagt tactccagta tatgctgatg gaccaggaag 180 

ccctgaatcc ataaaggcat accttaaaca agctccacct tccaaaggac ctactgttcg 240 

gacaaaaaaa gtagggaaaa acgaagctgt cttanagtgg gaccaacttc ctgttgatgt 300 

tcanaatgga tttatcagaa attatactat attttatana accatcattg gaaatgaaac 360 

tgctgtgaat gtggattctt cccacacaga aatntacatt gtcctctttg actagtgaca 420 

cattgtacat ggtacgaatg gcagcataca cagatgaagg tgggaaggat ggtccaaaat 480 

tcactttcac taccccaaan tttgctcaag gganaaattg aagccatant cgtgcctgtt 540 

tgcttancat tcctattgac aactcttctg ggaatgctgt tctgctttaa taagcga 597 



<210> 74 

<211> 257 

<212> DNA 

<213> Homo sapien 



<220> 

<221> misc_feature 
<222> (1) . . . (257) 
<223> n - A,T,C or G 



<400> 74 

tggtaaaggg taatagccag agnntagaac cttgangaga tgcggccaan gattctttat 
atctgaaccn agatgtnaaa naagaaaatg ctttgaggcc ttctaagcga tcctcctgtc 
taatttncac ctttgtctgg atgcacactt ctgaccncgc tgccacaacc tgtggggtct 
gatgtgtccc ttgatgggtg cggccctcag ggactgcacc ctgacaagtg ttnaggcaan 
attcctttct tgtgccc 



<210> 75 

<211> 330 

<212> DNA 

<213> Homo sapien 



<220> 

<221> misc_feature 
<222> (1) . . . (330) 
<223> n = A,T,C or G 

<400> 75 

tgttcataag gctggtgata naggggtctt gtcatggaaa ggtgctcttc caggaaacct 60 

ctgtgtatgg aggtcgnagc cacaatacgc ggacgangat gtgaacacct acaatgccgc 12 0 

catcncttac accatcctca gccaagatcc tgagctccct gacnaaaata tgttcnccat 180 

taacaggaac gcaggagtca tcggtgtggt cnccactggg ctggaccgaa agagtttccc 240 

tacgtgtacc ntggtggttc aagcngctga ccttcanggt gaggggttaa tcacnacagc 300 

ancngctgtg atcacagtca ctgntaccaa 330 
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<210> 76 

<211> 387 

<212> DNA 

<213> Homo sapien 

<220> 

<221> misc_f eature 

<222> (1) . . . (387) 

<223> n = A,T,C or G 



<400> 76 

gctcgcgcgc ctgcaggtcg acactagtgg atccaaagaa ttcggcacga gaacaacagt 6 0 

tatctccaag atgctattcg ttgaacccat cctggaggtt tccagcttgc cgacaaccaa 120 

ctcaacaacc aattcagcca ccaaaataac agctaatacc actgatgaac ccaccacaca 180 

acccaccaca gagcccacca cccaacccac catccaaccc acccaaccaa ctacccagct 24 0 

cccaacagat tctcctaccc agcccactac tgggtccttc tgcccaggac ctgttactct 300 

ctgctctgac ttgganantc attcaacana agccgtgttg ggggaagctt tggtaaattt 360 

ctccctgaag ctctaccacg ccttctc - 387 



<210> 77 

<211> 339 

<212> DNA 

<213> Homo sapien 

<220> 

<22l> mi sc_f eature 
<222> (1) . . . (339) 
<223> n ^ A,T,C or G 

<400> 77 

ctgctgatcn gggtcccttt ggagcacaga tgatgcnatg gccancnngg gacaacnacg 60 

tgatctgcgc cctggtcctg gtgtccatnc tggccctcgg nancctggcc gaggcccana 12 0 

canagacgtg tncagtggcc ccccgtgaaa gacagaattg tggttttcct ggtgtcacac 18 0 

cctcccantg tgcaaataag ggctgctgtt tcgacaacac cgttcgtggg gtcccctggt 24 0 

gcttctatcc taataccntc nacntcccnc canaaaagga ntgtgaattt tanacacttc 300 

tgcagggatc tgcctgcatc ctgacgcngt gccgtcccc 339 

<210> 78 

<211> 385 

<212> DNA 

<213> Homo sapien 

<400> 78 

tcggtcatag ggagagattt gtatgctgta ctatgcagcg tttaaagtta gtgggttttg 6 0 

tgatttttgt attgaatatt gctgtctgtt acaaagtcag ttaaaggtac gttttaatat 120 

ttaagttatt ctatcttgga gataaaatct gtatgtgcaa ttcaccggta ttaccagttt 18O 

attatgtaaa caagagattt ggcatgacat gttctgtatg tttcagggaa aaatgtcttt 24 0 

aatgcttttt caagaactaa cacagttatt cctatactgg attttaggtc tctgaagaac 300 

tgctggtgtt taggaataag aatgtgcatg aagcctaaaa taccaagaaa gcttatactg 360 

aatttaagca aaaaaaaaaa acccc 395 

<210> 79 

<211> 307 

<212> DNA 

<213> Homo sapien 
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<220> 

<22l> misc_feature 

<222> (1) . . . (307) 

<223> n = A,T.C or G 



<400> 7 9 

tcgatacagg gatgtcagag ctgccagaga ctttatcctg aagctttacc aagatcagaa 60 

tcctgacaaa gnagaaagtc atctactctc acttcacatg tgctacagat acagacaata 120 

ttcgctttgt gtctgctgct gtcaaagaca caattctaca gctaanccta agggaattca 180 

accttgtcta aaagctgctg cccactcctc ccctataaca gaagatgtga tttgcaaact 240 

ccttgtttta tttgnaagtg cttctgacat cnccagagcc agccccatgc caggaactaa 300 

ggatgtc 3 07 



<210> 80 

<211> 528 

<212> DNA 

<213> Homo sapien 

<220> 

<221> misc_f eature 

<222> (1) . . . (528) 

<223> n = A,T,C or G 



<400> 80 

gtcgatacag gaacagcatg tccaaatcga tgtggatgtt tccaagcctg acctcacggc 60 

tgccctgcgt gacgtacgtc agcaatatga aagtgtggct gccaagaacc tgcaggaggc 120 

agaagaatgg tacaaatcca agtttgctga cctctctgag gctgccaacc ggaacaatga 180 

cgccctgcgc caggcaaagc aggagtccac tgagtaccgg agacaggtgc agtccctcac 24 0 

ctgtgaagtg gatgccctta aaggaaccaa tgagtccctg gaacgccaga tgcgttgaaa 300 

tggaagagaa ctttgccgtt gaagctgcta actaccaaga eactattggc cgcctgcagg 360 

atgagattca gaatatgaag ganggaaatg gctcgtcacc ttcgtgaata ccaagacctg 420 

ctcaatgtta agatggccct tgacattgaa attgccacct acanggaact gctggangcn 480 

aagaaaacca ggatttctct gcctcctccn aacttttcct cccctgaa 528 



<210> 81 

<211> 369 

<212> DNA 

<213> Homo sapien 



<400> 81 

agcatggctc ccgaagtttt gccaaaacct cggatgcgtg gccttctggc caggcgtctg 60 

cgaaatcata tggctgtagc attcgtgcta tccctggggg ttgcagcttt gtataagttt 120 

cgtgtggctg atcaaagaaa gaaggcatac gcagatttct acagaaacta cgatgtcatg 180 

aaagattttg aggagatgag gaaggctggt atctttcaga gtgtaaagta atcttggaat 240 

ataaagaatt tcttcaggtt gaattaccta gaagtttgtc actgacttgt gttcctgaac 300 

tatgacacat gaatatgtgg gctaagaaat agttcctctt gataaataaa caattaacaa 360 

aaaaaaaaa 369 



<210> 82 

<211> 269 

<212> DNA 

<213> Homo sapien 



<220> 
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<221> misc_f eature 
<222> (1) . . . (269) 
<223> n = A,T,C or G 



<400> 82 

atgacaggga tgancaaact tngtctgggg tattgatgaa gatgacctac tgctgatgat 60 

accagtgctg ctgtaactga agaaatgcca. ccccttgaag gagatgacga cacatcacgc 120 

atggaagaag tagactaatc tctggctgag ggatgactta cctgttcagt actctacaat 180 

tcctctgata atatattttc aaggatgttt ttctttattt ttgttaatat taaaangtct 240 

gtntggnatg acaactnctt taaggggaa 269 



<210> 83 

<211> 196 

<212> DNA 

<213> Homo sapien 

<220> 

<221> misc_feature 
<222> (1) . . . (196) 
<223> n = A,T,C or G 



<400> 83 

tttgggtcca attacagcta aagcaaaagt ggttattgaa ctgcttctat cggtctcggg 60 

nnttgctaaa ccttcccagg tgtattttgg aggtacagtt gttggcnagc aagctatnaa 120 

atctgaagat gaagtgggaa gttnaatana gtatgaatnc agggtaagaa actnaggtaa 180 

acctcnaata tncctc 196 



<210> 84 

<211> 448 

<212> DNA 

<:213> Homo sapien 

<220> 

<221> misc_feature 
<222> (1) . . . (448) 
<223> n = A,T,C or G 



<400> 84 

caaacatggg catggtgtca gcgataatgt ttntancagc tcccgacata aatcagtaan 60 

tnngatttcc accatatcna ncntcnggaa tttaaccntc aggagnagct cttnntcaga 120 

cnccctggaa aaacgagccc cattgnancc anctttgana cataaaacct ggagaaattc 180 

tccaatacng aaggtatana gcggggcatc gttgacagca tcacgggtca aaggcttctg 240 

gaggctcagg cctgcaaagg tggcatcatc cacccaacca cgggccagaa cctgtcnctt 300 

caggacgcag tctcccnggg tgtgattgac caagacatgg ccaccaggct gaagcctgct 360 

cagaaagcct tcataggctt cgagggtgtg aagggaaaga agaagatgtc agcagcagag 420 

gcagtgaaaa aaaaaaaacc cctatatt 448 



<210> 85 

<211> 169 

<212> DNA 

<213> Homo sapien 

<400> 85 

agcagaccaa ctgccttttg tgagaccttc ccctccctat ccccaacttt aaaggtgtga 60 
gagtattagg aaacatgagc agcatatggc ttttgatcag tttttcagtg gcagcatcca 120 
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atgaacaaga tcctacaagc 



tgtgcaggca aaaccragca ggaaaaaaa 



169 



WORLD INTELLECTUAL PROPERTY ORGANIZATION 
International Bureau 




PCX 

INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) 



(51) International Patent Classification ^ : 

C12N 15A2, 15/62, C07K 14/47, 16/18, 
C12Q 1768, GQIN 33/53 



A3 



(11) International Publication Number: WO 00/22130 

(43) International Publication Date: 20 April 20(X) (20.04.lX)) 



(21) International Appiication Number: PCT/US99/24222 

(22) lotermttioaal KKng Date: U October 1999 (14.10.99) 



(30) Priority Data: 

60/104.351 
Not furnished 



15 October 1998 (15.10,98) US 
13 October 1999 (13.10.99) US 



(71) Applicant: CHIRON CORPORATION [US/USl; 4560 Horton 

Street, Emeryville. CA 94608 (US). 

(72) Inventor: GIESE. Klaus: Atiigcn Biotechnology GmbH. 

Roben-Rossic-Strasse 10, D-13125 Berlin (DE). 

(74) Agents: POTTER. Jane, E., R.; Seed and Bcr77 LLP. 6300 
Columbia. 701 Fifth Avenue, Seattle. WA 98104-7092 (US) 
ct ai. 



(81) Designated States: AE, AL. AM. AT, AU, A2, BA. BB. BG, 
BR, BY, CA, CH. CN. CR, CU, CZ, DE, DK. DM, EE. 
ES, FL GB. GD. GE, GH. GM HR, HU. ID, m IN, FS, IP, 
fCE. KG. KP. KR. JCZ, LC, LK, LR. LS, LT, LU, LV, MA. 
MD, MG, MK, MN, MW, MX, NO, NZ, PU PT, RO, RU. 
SD. SE. SG. SI. SK. SL. TJ. TM. TR, TT. UA, UG, UZ, 
VN, YU, ZA. ZW, ARIPO patent (GH. GM. KE. LS, MW. 
SD. SL, SZ, TZ, UG, ZW), Eurasian patent (AM, AZ. BY. 
KG, KZ. MD. RU, TJ, TM), European patent (AT. BE, CH, 
CY, DE. DK, ES. FI. FR, GB, GR, IE, IT, LU. MC. NL, 
PT, SE), OAPI patent (BF. BJ, CF. CG, CI. CM. GA, GN. 
GW. ML, MR. NE. SN, TD. TG). 



Published 

With international search report. 

(88) Date of publication of the international search report: 

5 October 2000(05.10.00) 



(54) Title: METASTATIC BREAST AND COLON CANCER REGULATED GENES 
(57) Abstract 

Gene sequences as shown in SEQ ID NOS: 1-85 have been found to be significantly associated with metastatic potential of cancer 
cells especially breast and colon cancer cells. Methods arc provided for detcnnining the risk of metastasis of a tumor, which involve 
detcmiining whether a tissue sample from a tumor expresses a polypeptide encoded by a gene as shown in SEQ ID NOS: 1-85, or a 
substantial portion thereof. 



JbNSCXDCID: <W0 0022130A3J 



FOR THE PURPOSES OF INFORMATION ONLY 

I 

Codes used to identify States party to the PCT on the front pages of pamphlets publishing international applications under the PCT. 





Albania 


ES 


Spain 


LS 


Lesotho 


SI 


Slovenia 


AM 


Armenia 


Fl 


Finland 


LT 


Lithuania 


SK 


Slovakia 


AT 


Austria 


FR 


France 


LU 


Luxembourg 


SN 


Senegal 


AU 


Australia 


OA 


Gabon 


LV 


Latvia 


sz 


Swaziland 


AZ 


Azerbaijan 


GB 


United Kingdom 


MC 


Monaco 


TD 


Chad 


BA 


Bosnia aixJ Hcr7xgi)vma 


GE 


Georgia 


MD 


Republic of Moldova 


TG 


Togo 


BB 


Barbados 


GH 


Ghana 


MG 


Madagascar 


TJ 


Tajikistan 


BE 


Belgium 


GN 


Guinea 


MK 


The former Yugoslav 


TM 


Turkmenistan 


BF 


Burkina Faso 


GR 


Greece 




Republic of Macedonia 


TR 


Turkey 


BG 


Bulgaria 


HU 


Hungary 


ML 


Malt 


TT 


Trinidad and Tobago 


ej 


Benin 


IE 


Ireland 


MN 


Mongolia 


UA 


Ukraine 


BR 


Brazil 


IL 


Israel 


MR 


Mauritania 


DG 


Uganda 


BY 


Belarus 


IS 


Iceland 


MW 


MalaM/i 


US 


United States of America 


CA 


Canada 


IT 


Italy 


MX 


Mexico 


uz 


Uzbekistan 


CF 


Central African Republic 


JP 


Japan 


NE 


Niger 


VN 


Vici Nam 


CG 


Congo 


KE 


Kenya 


NL 


Netherlands 


YU 


Yugoslavia 


CH 


Switzerland 


KG 


Kyrgyzstan 


NO 


Norway 


zw 


Zimbabwe 


CI 


C6le dMvoirc 


KP 


Democratic People's 


NZ 


New Zealand 






CM 


Cameroon 




Republic of Korea 


PL 


Poland 






CN 


China 


KR 


Republic of Korea 


FT 


Portugal 






cu 


Cuba 


KZ 


Kazakstan 


RO 


Romania 






cz 


Czech Republic 


LC 


Saint Lucia 


RU 


Russian Federation 






I)K 


Germany 


LI 


Liechtenstein 


SD 


Sudan 






DK 


Denmark 


LK 


Sri L^ka 


SE 


Sweden 






EE 


Estonia 


LR 


Liberia 


SC 


Singapore 







BNSDOCID <W0 0022130A3..L> 



INTERNATIONAL SEARCH REPORT 



intcm i«l AppliMtion No 

PCT/US 99/24222 



A. CLASSIFICATION OF SUBJECT MATTER 



IPC 7 C12N15/12 C12N15/52 C07K14/47 C07K15/18 
G01N33/53 

Aocordinq to IntemooonaJ Palvnt Ctaftsittcation (IPC) or to both national classification and IPC 



C1201/68 



B. FIELDS SEARCHED 



Mfnimum oooumvntuion M«rcn«a (cioasnicjiion »vst»m tallowed ay clossiticaijon iymoota) 

IPC 7 CG7K C12N C12Q GOIN 



nnnimanTTinii 



ammr tnon rrwrvnum docurnvmaDon to ttw •vtent mar sucrr a 



[ arv mcTuoM} m nte ttatctt svarcried 



E^tronic data Das* oonBune<J dunng tn» miernattonai searcn tn:inne ol d3L.i oaa« ana, wner* practical, searcn terms us*d| 



C. DOCUMENTS CONSIDERED TO BE RELEVANT 



Caleqory ' Citatxin of docurn«nt. with indication. wner« oooroonate. of thv rf>levant ooasaqes 



R4*i«vant ts claim No 



DATABASE EMSTS [Online] 
E.M.B.L. Databases 
Accession Number: G21051, 
1 June 1995 (1996-G6-G1) 
HUDSON T: "Human STS WI -12648, sequence 
tagged site" 
XP002134106 
96.3% identity 
SeqIdNo.l 
abstract 



in 134 bp overlap wi th 



SCOTLANDI K ET AL: "Multidrug resistance 
and malignancy in human osteosarcoma" 
CANCER RES, 

vol. 56, no. 10, 15 May 1996 (1996-05-15), 
pages 2434-2439, XP0G21341Q5 



1-8, 

12-14,18 



furrttmr documents aim luted <n the continuatjon of box C 



□ 



Patent family memben are listed in annex 



Special categonei of ctteo documents 

A' document defining the generaJ ttaie of tne art wnich <s not 

considered to oe of oarbcula/' reievance 
E* eartier document but oublahed on or ahcr the international 

fiting dale 

*L' document which may throw doubts on onoritY ctojmfs) or 
wnich IS cfted to estaoltsn the publication dale of another 
utaDon or other soeciaJ reason (as specfiied) 

'O* document refemng to an oraJ disclosure, use, eihibrtion or 
otner means 

'P* docurrfent published onor to the tnternatjonal Ming date but 
later than the pnonry oate cuiirrwd 



T' later document published after th« intematwnat tiling date 
or pnonry date and not in conflict with the application but 
cited to understand the principle or theory underlying tne 
invention 

*X* document of D articular relevance, the claimed invention 
cannot be consioereo novel or cannot be considered to 
invotve an inventrve tteo when the document ts taken alone 

*Y* document ol particular relevance, the claimed invention 

cannot be consioered to involve an inventive step wnen the 
document is combined with one or more other such docu- 
ments, such combination oeing Obvious to a person sKjIled 
tn the art. 

document member of the some potent family 



Oate of tne actual oompieuon o( the international searcn 



28 March 2000 



□ate of mailing of the intemationai searcn report 



0 5. 



Name and mailing aoaress ot the ISA 

Euiooean Potent Office. P B 5818 Patentlaan 2 
NL - 2280 HV Riiswip^ 
Tel. (t3t-70) 3402040. T« 31 651 eoo ni. 
Fax: (♦Sl-TO) 340-3016 



Autnorued officer 



Lonnoy, 0 



Pom PCT/;SA/2lO (lacona •n«Bll raufy 1?92) 



ONSDOCID. <W0 0022130A3_L> 



page 1 of 2' 



INTERWATinMAi SEARrH REPOHT 

Interna il Application No 

PCT/US 99/24222 




C.(Coniinu*tion) DOCUMENTS CONSIDERED TO BE RELEVANT 




uatagory 


CrtJiUon OI oocum.m. wnn moicaiion.wn.r. •pproDrmt.. oi tn» r«»«v«nt o«m«9«» 


R«t*v«nt to ctaim No 




A 
A 


CARMECI ET AL: "Identification of a gene 
(GPR3G) with homology to the 
G-protein-coupled receptor superfamily 
associated with estrogen receptor 
expression in breast cancer" 
GENOMICS, US, ACADEMIC PRESS, SAN DIEGO, 
vol. 45, no. 3, 

1 November 1997 {1997-li-Oi} , pages 
607-617-17. XP002099963 
ISSN: 0888-7543 

RADINSKY ET AL: "Level and function of 
epidermal growth factor receptor predict 
the metastatic potential of human colon 
carcinoma cells" 

CLINICAL CANCER RESEARCH, US, THE AMERICAN 
ASSOCIATION FOR CANCER RESEARCH, 
vol. 1, no. 1, January 1995 (1995-01) 
pages 19-31-31, XP002099964 
ISSN: 1078-0432 

^ 


il 





;.pNS0OClD <WO {X;22130A3_t_> 



INTERNATIONAL SEARCH REPORT 



In. itional aoptxsation No. 

PCT/US 99/24222 



Box I Observations where certain claims were lound unsearch nbie (Continuation of item 1 of first sheet) 

Thra Intamational Search Report has not been established in respect of certain claims under Article l7(2)(a) lor the following reasons: 

t. I I Claims Noi.. 

they relate to sub|ect matter norreq u redto be searched by ths Authonty, nomefy: 



2. Claims Nos., ^ 

because they relate to ports of the International Application that do not comply with the prescnti^ed requirements to such 
an extent that no meanmgM International Search can be earned out, specifically; 



see FURTHER INFORMATION sheet PCT/ISA/210 



3. I I Claims Nos.. 

because they are dependent claims and are not drafted in accordance with the second and third sentences of Rule 6, 4(a) 



Box H Observations where unity of invention is lacking (Continuation of item 2 of first sheet) 
This tntemattonal Searching Authonty found multiple inventions in this international application, as lollows 



1 , j As all required additional search lees were timely paid by the aopltcant, this International Search Report covers at! 
' ' searchable claims. 

2. I I As all searcnaPle claims could be searched without effort justifying an additional tee, thrs Authority did not invite payment 

of any additional lee. 



3. I I As only some of the required additional search fees were timely paid by the applicant, this International Search Report 
' ' covers only those claims for which fees were paid, specifically clajms Nos . 



4 \w \ No required additional searcn lees were timely paid by the aopltcant. Consequently, this International Search Report is 
restncted to the invention first mentioned in the claims; it is covered by claims Nos... 

lo^ A •. C£a;^s -. ^--2, '^-'V c-d (ae£. ^aA-r;a£^^ 

Remark on Protest [ | The additional search fees were accompanied by the applicant's protest 

j I No protest accompanied the payment of additional search fees. 



Form PCT/lSA/210 (continuation of first sheet (!}) (July 1998) 
BNSDOCtD: <WO 0O2213OA3_Ly 



intemalional Application No. PCT/US 99/24222 



FURTHER INFORMATION CONTINUED FROM PCTHSAJ £10 



1. Invemiion 1: Claims: 1-8.12-14 and 18 (all partially) 
An isolated ami oartfied human protein 

which is at least 85% identical to an aimno aci d 5eou«« ay a 

S^leotide sequence cans.sting .i Secid(»o.l.or 'Xlll;^ from 

fusion protein coronsing at least six ""^^guousj^'"" f^f ^^^^^^^^ 
an amino acid sequence encoded by the f soenf caiiy hind 

the coTOiement thereof; A preparation °^^"^^bodies which ^P^^^^^^' '^^^'""^ 
to a human protein which cononses an amino acid ^edue"" «ncoaea oy tne 
^Sc eotiae sequence of SeqIdNo.l or the comulement 

for detecting metastatic tumor cells in a tissue ^'^I"' '^^^"^^ 

of n«asunng in said tissue sample an expression P"-"?"" .f,^ "^l^'' 
°:^,se. a coding -.eouencs of SealdNo.l. whersin a tissue sample which 
^^D^e'ses tne oroauct is categorized as containing metastatic tumor cells 
A Sd for determining metastatic potential in a tissue =^1^ ""^"^'"5 
thr teo of measuring an expression product of a gene -•'^'^^ ^^-^^^^^ 2^,, 
Squence of SeqIdNo.l. wherein a tissue samnle f "^^^"=1, J , 
is categorized as having metastatic potential; A J^^?"^^, "^"^one 

propensity for metastatic spread of a breast tumor P"f«=^ent ally " 
?un? comprising the step of measuring in a breast tumor ""^'^ ^j'^.^'"'^"" 
product of a gene which cororises a sequence consisting °<^,^'=^^f'°^^,-, . 
«herein a breast tumor sample which expresses the product is categorized as 
having a propensity to metastasize to bone or lung. 



2. 



inventions 2-35: Claims: 1-8.12-14.18 and 19 (all partially, as applicable) 
AS for invention 1. but respectively relating " =X"ia '^'"o' 

|-P,"--^-^32^ \r.Tls,\''2l: l9-lh\l .45. 48 .si. 54. 55. 

57. 53, 60 ana 63 

3. inventions 36-43: Claims: 6-3.12-14.18 and 19 (all partially, as applicable) 

AS for invention 1. but respectively '"e'^ting to one sequence selected from 
the grouD consisting of SeaIdNo.64. 65. 55. 59. 70, 71. 72. 73. 74. 75. 
80. 32 ano 83. 



4. Inventions 49: Claims: 1-5. 9-11. 15-17 (all partially) 

An isolated and purified human protein comprising an ^""i"" J^i,^ "2"^"" 
which is at least 85% identical to an amino acid sequence encoded by a 
ni Jot'oe sequence consisting of SeqIdHo.3.or the compiemen her o . A 
fusion protein comprising at least six contiguous amino 'JJ"" 
In mi no acid sequence encoded by the "f'^otide sequence °J Jeq dHo 3 or 
the comoiement thereof; A preparation of antibodies which ^0^^^^^"' "'""^ 
to a human protein which comprises an amino acid /""^^^^^ f^l 

nScleotice seouence of SeqidNo.3 or the compiefi^nt thereof; A method for 
Setec? no ,T«tastatic tumor cells in a tissue sample ""^"^^"^ the step of 
measuring in said tissue sample an expression product of a gene which 
Tomr'ltl a sequence consisting of Seqldflo.3. wherein a tissue sample which 
do« not express the product is categorized as = ^f 

detennining metastatic potential in a tissue sample comprising he °J 
^^LTing Tn a tissue samole an expression P-^-*- ^ p= ^"^ ^ 

a sequence of SeQldflo.3. wherein a tissue satrole which does not express 
product is categorized as having metastatic potential 

5. Inventions 50-75: Claims: 1-5. 9-11. 15-17 (all partially) 

A5 for invention 49. but respectively reiat:ng to one seouence selected 
from tne rrouD consisting af SeqldMo.7. 3. 10 12 15. 15 17 2 .i. 
25. 28. 31. 34. 37. 42. 43, 44, 46, 47. 49. 50, 51, 5j. 39. 51. 52 



BNSDOCID <W0 . 00221 30A3. 1 



International Application No. PCT/US 99/2422' 



FURTHER INFORMATION CONTINUED FROM PCT/ISA/ 21G 



S. Invenc:ans 75-34: Claims: 9-Ll. 15-17 (all paniaily) 

AS for invention 49. but respectively relating to one seouence seiectefl 
from the group consisting of SeqIdNo.67. 68, 75. 77, 78. 79. 81. 84 ana 



ana 35 



7. Invention 85: Claim : 20 (totally) and 1-5 (all partially) 



An isolated and purified human protein comprising an amino acid seauencp 
which ,s at least 85% identical to an amino acid seauence encoaed a 
y^^^nn nrnt."""''"" "nsisting of SeQldfJo.SS or the cpimierent thereof- A 
fusion protein comprising at least six contiguous amino acids seiectedfrnm 
ZTaZt^VT^"". """'^"'^ nucleoside seauence of Seg dNo 56 or 

/h^^ ' thereof: A preparation of antibodies which specific™ v b?nd 
n.H.S "''i'^ comprises an amino acid seauence encoded by the 

"tl^rrl^t, ^^°"«"« Of SeqIdNo.56 or the complement thereof; A method of 
crpo o^ ™ '""''^•^"^^^y nstastatic spread of a colon :umo; coTO^ising .he 
hfrh rn * ""^^ ' « ^" expression croauct of a gL 

which comprises the nucleotide seauence shown ,n Seaidflo.=6 whercn a colon 
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FURTHER INFORMATION CONTINUED FROM PCT/ISA; 210 



Continuation of Box 1.2 
Claims Mos.: 4 



Claim 4, which is directed to a fusion protein which comprises a first 
protein segment and a second protein segment fused to each other by means 
of a peptide bond, wherein the first protein segment consists of at least 
six contiguous amino acids selected from an amino acid sequence encoded 
by a nucleotide sequence of SeqIdNo.l or the complement thereof, 
encompasses an extremely large number of sequences. In view of that huge 
number, a meaningful complete search cannot be carried out. 
Furthermore, as none of the claimed sequences as defined in claim 4 is • 
disclosed in the application, the Search Division is provided with no 
guidance to carry out at least a meaningful partial search. 

The applicant's attention is drawn to the fact that claims, or parts of 
claims, relating to inventions in respect of which no international 
search report has been established need not be the subject of an 
international preliminary examination (Rule 56.1(e) PCT). The applicant 
is advised that the EPO policy when acting as an International 
Preliminary Examining Authority is normally not to carry out a 
preliminary examination on matter which has not been searched. This is 
the case irrespective of whether or not the claims are amended following 
receipt of the search report or during any Chapter II procedure. 



■'i 



RNnDOCIO ■■WD nn22MnA-\ I - 



